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Call for Convention Papers 
MEMBERS AND FRIENDS OF THE AES: 


1958 - 


71743 = 19 


TEN years that have seen the emergence and development of the 
Audio Engineering Society to a position of importance and prestige in 
the engineering field. 


Do YOU realize that YOU can play a vital role in continued Audio 
progress? YOU can, by personally participating in this year’s Conven- 
tion, and this participation includes your papers, or suggestions for 
papers to the Convention Committee. 


The Annual AES Convention has played a vital role in increasing the 
prestige and position of the AES. Last year over 60 outstanding papers 
were presented. Each one was a worthwhile contribution to the field 
of Audio Engineering. This year, we hope to make the 10th Annual 
Convention an outstanding event in AES history. 


As usual, we wish to encompass a wide range of audio engineering 
fields. Some suggested subjects are: 


Artificial Reverberation Quality Control Stereo Disc-Recording 
Audio Applications Recording and Reproduction Stereo Studio Techniques 
Electronic Music Calibrations Studio Acoustics, Stereo 
Fine Groove Recording Speaker Systems, Monaural Studio Design for Stereo 
New Record Manufacturing and Stereo Tape Duplication Advances 
Techniques Standards for Home Audio Tape Recording—New Uses 
Psycho-acoustics Stereo Cartridge-Design The Transistor in Audio 
| Of course, the listed categories are only a portion of the possible sub- 


jects for papers presentation. 


The subjects presented at the Convention, which is to be held in the 
Fall, will, as usual, be determined by YOU. 


Just send to the Convention Committee a title and 25 word summary 


of a paper you wish to present personally this year at the Convention. 
Or, submit your recommendations for subjects you wish presented and 
possible authors of these papers. 


Please send all suggestions and titles to Convention Committee, P. O. 
Box 12, Old Chelsea Station, New York 11, New York. 


Donald Plunkett, Convention Chairman 
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JOURNAL OF THE AUDIO ENGINEERING SOCIETY 


APRIL 1958, VOLUME 6, NUMBER 2 


Editorial 


This issue consists entirely, with one exception, of papers 
dealing with stereophonic sound. Many were initially pre- 
sented at the highly successful stereo session of the last 
Convention. A cursory scan of their texts makes apparent 
the varying levels of sophistication at which this subject 
is treated. This merely reflects the fact that while experi- 
ment and discovery are still in progress, a substantial corpus 
of achievement now exists. As the opening paper by Offen- 
hauser informs us, preoccupation with stereophonic problems 
began with the first demonstrations of the telephone, the 
first electrical sound reproducing device. Although solu- 
tions to these problems were sought in the years since, the 
impetus was supplied principally by their technical and 
military applications, and the increase in information which 
a second channel could contribute was not deemed of suffi- 
cient commercial importance. Only at this late date has its 
exploitation aroused enough interest to engage a substantial 
segment of the audio entertainment industry, and we now 
see considerable engineering and advertising talent devoted 
to its development and popularization. 

The present interest in stereo is without doubt a normal 
and natural advance in the evolution of so-called high 
fidelity sound reproduction. It has however been accelerated 
during this postwar decade by the remarkable mass enthusi- 
asm for high quality reproduction, the rapid development 
of new materials and new electronic techniques, and above 
all by the coming to maturity of a new generation of audio 
engineers. Perhaps it will not be too much to hope that 
these factors—fresh ideas, new blood, and more money— 
will be more effective in closing the temporal gap between 
invention and commercial product. 

The emergence of audio as a mass preoccupation, with 
components mass-produced and mass-consumed, is not with- 
out a concomitant penalty: built-in obsolescence. A glance 
at some products offered for sale today indicates instances 
where engineering has been a secondary consideration. From 
a somewhat different viewpoint, the exploitation of stereo 
reflects the same type of economic necessity. If one is to 
judge from exhibits at the last Audio Fair, outside of stereo, 


no significantly new developments in audio have occurred in 
the past year. The advent of stereo therefore creates new 
products, new sales, renewed consumer unhappiness with 
what he purchased last year, and we hope, more work for 
audio engineers. 

The state of “perfection” in which stereo now finds itself 
leaves a great deal to be desired. To be-sure, after the 
novelty has worn off, what remains will be recognized as an 
obvious improvement in auditory quality. From an engineer- 
ing viewpoint, however, serious shortcomings are too evi- 
dent, especially in the products for mass consumption. 
Agreement is lacking, for example, on the quantitative speci- 
fication of such elementary parameters as interchannel 
crosstalk, permissible distortion, and channel geometry, to 
name only three. There ought to be concerted effort and 
research on the part of the profession and the organizations 
which they represent to pin down those elusive factors which 
make stereo such an improvement. In so doing the setting 
up of standards will be enabled (it is to be hoped), with a 
consequent uplift in engineering quality and morale. If dis- 
cussions of the sort represented by the “Stereo Tape Sym- 
posium,” which appears in these pages have any value in 
attaining these objectives, then they should occur in pro- 
fusion. 

At this point the proper question to be asked is: “After 
stereo, what?” The answers, at best speculative, should 
probably be left to a later date. I should however like to 
conclude by pointing out that progress in audio may have 
this much in common with progress in fields such as space 
travel and nuclear energy: all require a stockpile of pure 
research. And unless, to reiterate, considerable engineering 
effort is continuously applied to this type of inquiry, instead 
of to the development of ultimately meaningless “gimmicks,” 
stagnation will set in. 

G. Hortucnut 


(The Editor alone assumes responsibility for this page. The opin- 
ions expressed here do not necessarily reflect the viewpoint of the 
Society.) 
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JOURNAL OF THE AUDIO ENGINEERING SOCIETY 


APRIL 1958, VOLUME 6, NUMBER 2 


Binaural and Stereophonic Sound” 
In Retrospect 


WIiLuiaM H. OrrenHAuseR, JR. 
New Canaan, Connecticut 


a PAPERS that appear in this issue are milestones 

in the development of the art of sound reproduction. 
When, two decades ago, Maxfield described the sound mo- 
tion picture as the application of engineering equipment to 
art, his definition was intentionally drafted in sufficiently 
broad terms to encompass such art, wherever it might be 
found—in the theatre, in business, in universities and mu- 
seums, and last but not least, in the home. 

In terms of the human experience called culture, art is 
old and engineering equipment is new. Buried in the yel- 
lowed pages of dusty books are many ideas, suggestions and 
concepts that even now await suitable engineering equip- 
ment to transport them from the realm of speculation and 
even fantasy into everyday reality even as artificial satel- 
lites have so moved. 

Soon after the telephone was invented, before the turn of 
the century, in 1881, an experiment was performed at the 
Paris Opera in which two separate channels were used, each 
consisting of a telephone transmitter, an interconnecting 
wire and a telephone receiver. History records that it was 
quite logical at the time’ to place one receiver at one ear 
and the other at the second ear. There was some “puzzle- 
ment” as to what to do with two transmitters. Failing to 
find a pragmatic single channel input arrangement that 
appeared to be logical in terms of the hardware and its 
interconnection, which arrangement would suitably produce 
adequate spatial impressions, the unhappy investigators 
continued to use the two transmitters. Seemingly they 
were never really satisfied with that arrangement; after all, 
human beings may be blessed with two ears, but they do 
have only one mouth! 

A little reading of such journals as Physical Review and 
the like in the period between roughly 1880 and 1930 shows 
that quite a number of erudite persons (such as a former 
President of Yale University) were fascinated by the ap- 
parent contradictions of logic implied by the various plausi- 
ble theories of the time and the practical test results with 
people. All investigators seemed to agree explicitly or im- 


* Received Nov. 3, 1957. Revised manuscript received March 5, 
1958. 
1 Lewis Carroll published Symbolic Logic in 1897. 
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plicitly that the difficulty rested in their inability to “get 
inside the head” of the subject under test. In other lan- 
guage, if the sensing device, the ear, has unknown charac- 
teristics, upon what basis should an investigator measure 
the stimulus presented to the ear which measure will have 
significance in terms of the overall performance of the 
system which includes the mysterious black box that senses 
what is presented? G. W. Stewart of Iowa State University, 
among others, spent the better part of a very productive 
lifetime studying the characteristic of the input test signal 
to the ear in order to provide clues to the dynamic param- 
eters of the puzzling black box. His papers in the Physical 
Review from 1912 to 1920 are especially worthy of study. 
Harvey Fletcher’s work is likewise worthy even though he 
concentrated upon steady tones while most ordinary sounds 
are transients. 

In Great Britain the pragmatic approach was pursued by 
A. Rosenberg who filed a patent application for two-channel 
recording and reproduction on October 25, 1911 which was 
issued as British No. 23,620. Rosenberg’s clarity is re- 
markable; it could well serve as a model for all who follow. 
Seemingly he followed the lead of the French since he used 
two separate microphones to record two separate sound 
tracks. However, the enigma of “two ears—one mouth” 
remained. 

Although little has been published as yet, it is known 
that much progress in acoustic localization was made prior 
to and during World War I with radio-controlled and 
sonically-controlled surface and sub-surface torpedoes, and 
with hydrophones for the location and detection of ships, 
submarines and torpedoes. The record available so far 
hints strongly that homing devices were studied and that 
important progress was made even at that early date. A 
significant weakness was the lack of reliability in electron 
tubes which were then available in rather small quantity, 
rather than the lack of know-how. 

By 1924 when Kuechenmeister filed his British patent on 
the delay effect, the Research Department of the Western 
Electric Company (now the Bell Telephone Laboratories) 
had already published numerous papers in the Bell System 
Technical Journal on the subject of telephone transmission. 
Otto Zobel, for example, published “Theory and Design of 
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Uniform and Composite Wave Filters” (pp. 1 to 46) in 
January, 1923 and followed it with “Transmission Charac- 
teristics of Electric Wave Filters” (pp. 567 to 620) in Octo- 
ber, 1924. In the latter issue, John R. Carson published 
“The Building-up of Sinusoidal Currents in Long Periodi- 
cally Loaded Lines.” Zobel’s 1924 paper lists Campbell’s 
November 1922 paper on wave-filters as a reference. 

The Kuechenmeister patent which applied to binaural 
recording is British No. 258,864 which had a Convention 
date of September 22, 1925. He described the delay effect 
as a phase difference and stated it as “—about 1/30 to 1/8 
second.” Olson’s curve given in Fig. 7 of his paper in this 
issue shows that Kuechenmeister’s lower limit is on Olson’s 
curve. If the slope of the curve is assumed to follow the 
trend indicated, it seems likely that the 1/8 second value 
will also be confirmed. 

In 1934 Harvey Fletcher and others of the Bell Labora- 
tories published “Auditory Perspective” in Electrical Engi- 
neering. These papers were the kick-off for an intensive 
effort to bring to the attention of the motion picture indus- 
try and the public the stereophonic accomplishments al- 
ready available. In a typical demonstration, a live orchestra 
would perform one-half of a selection and the Bell Labora- 
tories equipment the other half—and not even the break at 
the changeover revealed the fact that engineering equipment 
was performing while the musicians were merely doing a 
pantomime. Compression and expansion using control 
tracks (such as described by the patent to Sacia of Bell 
Laboratories in 1924) had been incorporated so skillfully 
with so little perceptible distortion that not a trace of back- 
ground noise remained at any part of the performance to 
reveal the presence of mechanism. Motion picture film 
running at 90 feet per minute (with variable area sound 
track) performed the wonders. 

In 1934, also, Norman MacLachlan of London published 
his outstanding book Loudspeakers. Even now, that text 
appears to be a rigorous and complete mathematical analysis 
of the performance of loudspeakers, the most compact pre- 
sentation to date. Even the now-popular extended source 
devices called electrostatic loudspeakers received the same 
thorough treatment as the conventional cone and horn 
loudspeakers then popular. In the author’s own view, the 
physical embodiment of principles may change and rapidly, 
but the principles, if properly and fully stated mathemati- 
cally, are as ageless as time itself. 

In these remarkable demonstrations the “two ears—one 
mouth” enigma remained; a middle channel had been added 
to two-side (left and right) channels to reduce the bowing 
effect at the center which Paul Klipsch mentions in this 
issue. Thus there were three ears and three mouths as well 
as two ears and two mouths—and arrangements for adapt- 
ing two to three and three to two. Since space consists of an 
infinity of points (a point in space is dimensionless), there 
were disclosures of multiplicities of such things as micro- 
phones and loudspeakers even to the point where it became 
reasonable to ask whether there were sufficient interstices 


WILLIAM H. OFFENHAUSER, JR. 


to allow for performers in a studio or observers in an audi- 
ence! But the sales statistics fail to disclose a herculean 
jump in microphone and loudspeaker sales in the years that 
immediately followed even though Fantasia did use multi- 
plicities of speakers in some theatres where it was shown. 

From 1931 to 1936 suggestions were made in patents 
(Beers, Halstead, Hammond, Jones, Doolittle, Patterson, 
Douden) that it is possible to move a virtual source of 
sound produced by loudspeakers by means of circuitry and 
hardware when the real event occurred without such move- 
ment. Jones, for example, may be said to have disclosed 
“the wiggling of microphone ears in a dummy head.” Such 
heads, being dummies, have little sense; they seem unable 
to distinguish front from rear. None of the inventors dealt 
with the “two ears—one mouth” enigma. 

In 1937 Stevens and Davis published “Hearing,” a much- | 
needed textbook describing the contents and performance of | 
the black box. We were informed that the bundles of fre- | 
quency overlapping aural nerves which seemingly are of | 
lengths and diameters proportional to piano strings fire like ] 
thyratrons when stimulated, and have recovery times in the | 
order of several cycles of the tone that causes the particular 
nerve to fire. Von Békesy reported upon the cochlear re- 
sponses of the cat, and we were well started on the way 
toward understanding the black box that we call the ear. 

Hilliard (1938) observed that, in a two-way loudspeaker 
system in motion picture theatres, if the woofer is out of 
phase with the Fletcher horn (a Western Electric multi- 
cellular straight horn used for the upper register), the sound 
jumps away from the loudspeaker and the polarity of the 
voice coil leads of either woofer or horn driver must be 
reversed to “bring it back where it belongs.” Equipment 
servicemen in the theatre developed “golden ears”—a horn 
was phased with respect to the heavy woofer by sliding it 
forward or rearward with respect to the face of the woofer 
above which it was mounted until sounds in the 400-cycle 
crossover region seemed free of audible distortion. In one 
aspect, that distortion is similar to telephone line echo 
which occurs with mismatched impedances. When speakers 
were improperly phased, the sound produced by the inter- 
ference was very harsh, unpleasant and irritating. The 
problem of matching the radiation of a plane wave front 
to that of a spherical wave front received a pragmatic solu- 
tion which satisfied multitudes of movie fans—if the bulging 
box offices of the nation can be taken as a criterion. 

When Sound Motion Pictures was published by the Acad- 
emy of Motion Picture Arts and Sciences in 1938, Hilliard 
reported upon some delay measurements in a two-stage 
amplifier (wide range) that are worthy of mention. With 
53-db gain, the transit time for a 400-cps signal from input 
to output is given as 0.1 millisecond while the value at 40 
cps increased to 4.2 milliseconds. If one thinks of a repro- 
ducing system as the equivalent of a single mechanical 
device, maximum energy efficiency (power factor) should 
be found at the center of the frequency band and poorer 
power factors (significant phase shift) at the range extremi- 
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ties. With this thought in mind, is it reasonable to think 
that a 40-to-1 time delay ratio is meaningless? Olson’s 
data in this issue of the JouRNAL would seem to be saying 
“no” despite the fact that Hilliard reported the widely held 
belief of 1938 that a delay of as much as 75 milliseconds 
at 30 cps, such as might occur with several low pass filters 
in tandem, did not seem to make much difference in per- 
ceived sound quality in a single-channel wide range system. 

In October 1938 Offenhauser and Israel demonstrated a 
single channel binaural system to the Society of Motion 
Picture Engineers at the Detroit Convention; the paper was 
“Some Production Aspects of Binaural Recording for Sound 
Motion Pictures” published in JSMPE, February, 1939, 
pages 139-155. The last six pages list bibliography and 
patents mentioned in this resumé, and include more words 
(in fine print) of discussion than were written in the paper 
proper. A reading of that discussion by one familiar with 
the art of today points up some of the popular misconcep- 
tions of the time, many of which still persist. To the writ- 
er’s knowledge, this was the first public demonstration of a 
solution to the “two ears—one mouth” enigma that had 
puzzled so many in the past—and which still continues to 
puzzle. That paper showed that sounds may be moved up- 
ward and downward as well as backward and forward and 
right to left. At one point in the demonstration, with two 
Langevin loudspeakers placed side by side, sounds appeared 
to originate at will off to one side at an angle of some 30° 
to the left of the loudspeaker projection axis and at an angle 
of 45° above the horizontal—far beyond the limits of the 
loudspeakers. U. S. Patent No. 2,246,593, dated 24 June 
1941, described this and other interesting characteristics of 
binaural systems. Dr. Schroeder’s paper in this issue ex- 
hibits mathematics and circuitry that seem pertinent. 

By the time World War II occurred, the submarine had 
become an important weapon, and much midnight oil was 
burned to improve the methods of detection and ranging. 
Again, it is unfortunate that details are not available, as 
many facts that must have been learned in sonar study 
would probably have application to binaural and stereo- 
phonic sound. 

We must, of course, mention the stereophonic sound 
tracks in recent years of Hollywood and like films such as 
Cinerama, Cinescope, Todd-AO, and the many varieties of 
pan-pot techniques that are the everyday practice in film 
production. We cannot ignore control tracks which started 
with Sacia and which are used for switching as well as for 
expansion and compression control. The art is too extensive 
to justify more than mere mention here; a thorough paper 
would be required as a minimum. 

For years the process of binaural fusion has been a mys- 
tery. Year after year new facts have been slowly extracted 
—nature does not yield her secrets to the faint-of-heart. In 
the September 1957 issue of the Journal of the Acoustical 
Society of America, Sayers and Cherry discuss the “Mech- 
anism of Binaural Fusion in the Hearing of Speech.” Al- 
though that paper does not lift the curtain very far on how 


binaural fusion occurs in music, it does set forth quite 
lucidly the currently popular comb-filter concept for speech 
such as Dr. Schroeder has generalized in his paper. 

Within the last two or three years, two-channel tapes and 
disks have become available for use in the home; seemingly, 
home binaural (stereophonic) usage presently presumes two 
ears possibly for the reason that mater familias would put 
her pretty foot down if her husband should wish to install 
multiplicities of loudspeakers all over the house. Could it 
be that Martin Mayer in “1957 Edition Hi-Fi” is correct 
in stating: 

“It may be said that the theory of binaural sound is in 
large part meaningless.”’? 

But Mayer’s dictum is true only of those who do not 
know the theory of binaural sound and to whom it is there- 
fore, meaningless. To those who know the details of the 
history that is recorded in these pages, the theory of binaural 
sound, like the theory of man-made satellites, is another 
challenge to man to discover the secrets of the environment 
within which he lives. Who can doubt that this too will 
become an open book if the Journat of this Society and 
fellow Societies continue to publish papers as revealing as 
those which appear in the pages that follow? 


APPENDIX 1. Some U. S. Patents of Interest. 
Wier—1,508,432 
Sacia—1 623,756 
Doolittle—1,817,177, 2,126,370 
Jones—1,817,489, 1,855,147, 1,855,148, 1,855,150, 1,855,- 
151, 1,944,182 
Maxfield—2,019,615, 2,019,616 
Blumlein—2 093,640, 2,098,372 
Israel—2 246,593 
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A Compatible System of Stereo Transmission 
by FM Multiplex * 


Murray G. Crossy 
Crosby Laboratories, Inc., Hicksville, N. Y. 


A method of applying modulation to the channels of a stereo FM multiplex transmitter is 
described which provides a “compatible” reception for the monaural listener. By transmitting 
an additive combination of the microphones on the main channel of the FM transmitter and a 
subtractive combination on the sub-carrier channel, the main channel transmits a balanced pro- 
gram without the deterioration which would be brought about if the output of only one micro- 


phone were transmitted on the main channel. 


At the receiver the combinations are separated so 


that the microphones feed the individual loudspeakers directly. 

Additional advantages of the system include a 6 db signal-to-noise gain for the sub-carrier 
channel. This gain is highly desirable because the relatively poor signal-to-noise on the sub- 
carrier channel curtails the distance range of the stereo transmission. In addition, the signal-to- 
noise ratio appearing in the receiver channel is made identical so that disturbances due to noise 
appear symmetrically in both loudspeakers. This results in less annoyance due to noise. 

A discussion of relative signal-to-noise ratios on the main channel and sub-carrier channels is 


given, together with recommendations for optimum performance of a stereo transmission. 


INTRODUCTION 


- IS THE OBJECT of this paper to describe an im- 

proved method of transmitting stereophonic sound on an 
FM transmitter with many advantages for both the mon- 
aural and stereo listener. An outstanding advantage is 
obtained for the monaural listener who is equipped to re- 
ceive only the main channel without the benefit of stereo 
reception. With the system to be described, that listener 
has a compatible reception with the program properly bal- 
anced with respect to microphone pickup. The advantages 
obtained for the stereo listener include a 6-db improvement 
in signal-to-noise ratio realized from the relatively poor 
signal-to-noise ratio on the sub-carrier channel, together with 
a balancing of the signal-to-noise ratios on the two channels 
so that a more pleasing response is obtained. 


CONVENTIONAL STEREO FM MULTIPLEX SYSTEM 


Figure 1 shows the conventional FM multiplexing sys- 
tem which has been used to demonstrate stereophonic trans- 
mission by Armstrong and Halstead. In this system micro- 
phone A is fed to the main channel of the FM transmitter 
and microphone B is fed to the sub-carrier generator which 
is applied as a multiplex modulation on the FM wave. 


* Revised manuscript received March 25, 1958. Delivered at the 
Ninth Annual Convention of the Audio Engineering Society, New 
York, October 11, 1957. 


Application of the sub-carrier modulation may also be 
accomplished by the use of an auxiliary phase modulator 
following the main FM modulator with the same basic 
result in the wave transmitted. This transmitting system 
is shown in Fig. 2. It utilizes a form of multiple phase 
modulation,’ applied to successive stages, so that the re- 
sultant modulation is the sum of the separate modulations. 
Use of this principle for sub-carrier modulation is described 
by E. H. Armstrong.” 

The conventional techniques in which the separate micro- 
phone outputs are transmitted over the separate channels, 
as shown in Figs. 1 and 2, have several disadvantages. 
First of these is the lack of compatibility for the monaural 
listener. Since that listener will tune in the main channel 
only, a degraded program pickup is received by virtue of 
the reception of the output due to one microphone only. 
That single microphone will receive its pickup from one 
side of the stage only so that the instruments near it will 
be over-emphasized and those near the other microphone 
will be de-emphasized. The pickup of a soloist who hap- 
pens to be nearest to the opposite microphone will be es- 
pecially degraded. 

The second disadvantage of the conventional system is 
the unbalance of the signal-to-noise ratios on the main and 
sub-carrier channels. The sub-carrier channel has an effi- 


1M. G. Crosby, U. S. Patent No. 2,104,318, Jan. 4, 1938. 
2U.S. Patent No. 2,630,497, March 3, 1953. 
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“9 
SUBCARRIER vee 
= GENERATOR MODULATOR POWER AMP. 
FM AUDIO 
TUNER LIMITER RIMENA AMPLIFIER a 
SUBCARRIER AUDIO B 
RECEIVER [| AMPLIFIER 


Fic. 1. Conventional FM multiplexing system. Channel A is fed 
to the main channel while channel B is fed to the sub-earrier gen- 
erator which is applied as a multiplex modulation on the FM wave. 
At the receiver, the main channel is derived directly from the dis- 
eriminator. The sub-carrier channel requires a sub-earrier receiver, 
with a high-pass filter to remove the main channel, an amplifier, 
limiter and a detector. 


ciency of transmission which is much less than that of the 
main channel so that the signal-to-noise ratio on one loud- 
speaker may be as much as 25 db poorer than that on the 
other. This produces an annoying one-sidedness to noise 
disturbances which impairs the pleasantness of reception. 
With the sum-and-difference technique to be described, the 
signal-to-noise ratios on both loudspeakers are the same 
and the effective transmission of the sub-carrier channel is 
improved by 6 db. The 6-db improvement in sub-carrier 
signal-to-noise ratio is equivalent to a four-to-one increase 
in power of the transmitter since with the stereo system the 
performance is only as good as that of the poorest channel. 
Accordingly the sum-and-difference technique results in an 
increase in the effective range of the stereo transmission. 
This 6-db improvement in signal-to-noise ratio may, under 
certain conditions, be realized as a 6-db improvement in 
overall hum and distortion. If it so happens that the hum 
is predominately from either the sub-carrier channel or the 
main channel taken alone, the result is a 6-db reduction in 
hum. This is brought about by the signals from the sum- 
and-difference combinations adding in phase to produce 


FREQUENCY 
FM FREQUENCY PHASE MULTIPLIER 
- MODULATOR MULTIPLIER MODULATOR = 
POWER AMP. 
SUBCARRIER 
s 
D GENERATOR 


Fig. 2, Another FM multiplex system. This utilizes an auxiliary 
phase modulator which adds a phase modulation of the sub-carrier 
to the main channel. The sub-carrier generator is an FM generator 
with a range between 20 and 75 ke to which is applied the modu- 
lation frequency. 


twice the voltage while the hum remains at a value equal 
to that of the channel with the highest hum. The only con- 
dition in which this improvement would not be realized 
would be that of an equal hum level in both main and 
sub-carrier channels which was so phased as to add up to 
twice the voltage in the sum-and-difference combination. 
This would be a rare coincidence. In the case of distortion, 
the same situation exists; the channel with the lowest dis- 
tortion aids the channel with the highest distortion by a 
factor of 6 db in the overall system. In experimental work 
it has been observed that the sub-carrier channel has the 
least distortion and hum. Under these circumstances the 
sum-and-difference combination is responsible for a 6-db 
reduction in both hum and distortion as well as noise, in 
the overall system response. 


SUM-AND-DIFFERENCE SYSTEM 


Figure 3 shows the sum-and-difference technique of ap- 
plying a twin-channel stereo system to the FM multiplex 


A-B SUBCARRIER 


GENERATOR | 


| A+B 


MODULATOR Pp. 


TRANSMITTER 
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; (A+B) - (A-B) = 28 


SUBCARRIE! T 
RECEIVER | A-B AMP. B 
IL min J LiF 
TUNER ‘ ~s : 
| (A+B) + (A-B) © 2A 
= 
Ass 
RECEIVER 
Fic. 3. Sum-and-difference system. Channels A and B are 
mixed so that the sum (A + B) is fed to the main FM transmitter 
channel while the difference (A —B) is fed to the sub-carrier gen- 
erator. Both are modulated in the main FM modulator, At the 
receiver, (A + B) appears at the output of the receiver discrimi- 
nator and (A-—B) at the output of a sub-earrier receiver. A 


phase inverter adds and subtracts these two signals and separates 
the original two channels. 


AMP, A 


system. Microphones A and B are mixed so that the sum 
combination (A + B) is fed to the main channel of the 
FM transmitter and the difference combination, (A —B) 
is fed to the sub-carrier channel. The sum combination 
is readily recognizable by its improved bass response since 
the difference combination results in a microphone directive 
array which tends to cancel lower frequencies. 

At the receiver the outputs of the main channel discrimi- 
nator and the sub-carrier adapter are fed to a mixing sys- 
tem utilizing a phase inverter such that one amplifier and 
speaker receives the combination, 

(A + B)-(A-B) = 2B 
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and the other amplifier and speaker 
(A+ B) + (A-B) =2A 

This addition and subtraction separates the individual 

microphone outputs from their sum-and-difference combi- 

nations just as though the A and B microphones fed the 

A and B speakers directly. 

It will be noted that the program on the main channel of 
the transmitter is the desired A + B combination of micro- 
phones which provides the compatible balanced program for 
the monaural listener. Since the sub-carrier channel is 
superaudible, the monaural listening is unimpaired by the 
stereo transmission. 


SIGNAL-TO-NOISE CHARACTERISTICS 


With the sum-and-difference technique, the resulting sig- 
nal fed to one loudspeaker is an addition of the signal from 
the main channel and that from the sub-carrier channel. 
This combination doubles the amplitude of the signal and 
adds the noises from the two channels by rms addition. 
The noise from the sub-carrier channel is so much greater 
than that on the main channel that the rms addition of 
the two noises results in a noise having an amplitude equal 
to that of the sub-carrier channel. Since the signals have 
been doubled and the noise is equal to that on the sub- 
carrier channel, the resulting signal-to-noise ratio fed to 
either loudspeaker input is equal to 6 db above the normal 
signal-to-noise ratio on the sub-carrier channel. 

In a previous publication,® data was presented on stereo 
recordings which indicated that the level of the difference 
combination (4 —B) was approximately 10 db below that 
of the sum combination (A + B). If such data were char- 
acteristic, an additional 10-db improvement would be re- 
alized with respect to the sub-carrier channel since the 
recombination at the receiver would call for a 10-db attenu- 
ation of the signal and noise from the sub-carrier channel. 
However, more recent measurements have shown that this 
early data was not characteristic and that the difference in 
level between the sum-and-difference combinations is negli- 
gible. The early data was taken from experimental tape 
recordings of rehearsals at Carnegie Hall by the late Harold 
T. Sherman. The conditions of recording were such that 
the distance between the orchestra and the microphones 
was unusually large. This results in a small phase differ- 
ence between the pickup from the two microphones so that 
the difference combination tends to cancel on a wide range 
of frequencies. With modern stereo microphone placement 
techniques, the microphones are placed so close to the or- 
chestra that the difference combination tends to cancel only 
the lowest frequencies and is therefore approximately the 
same level as the sum combination. Observations on a 
large number of present-day stereo tapes have confirmed 
this conclusion. 


3 Murray G. Crosby, Binaural Sound on One FM Channel, TV & 
Radio Engineering, Aug. and Sept. 1953. 


MURRAY G. CROSBY 


RECOMMENDED MODULATION STANDARDS 


The present FCC rules for an FM Subsidiary Commu- 
nications Authorization allow an application of 30 per cent 
of the available 75-kc deviation of the FM transmitter to 
be applied to the sub-carrier channel. Where the sub- 
carrier channel is to be used for directed background music 
or some other auxiliary service, this appears to be a prac- 
tical value to protect the quality on the main program. 
However, where the transmitter is devoted wholly to broad- 
casting a stereo program, a division of modulation with at 
least 50 per cent modulation on the sub-carrier channel 
would appear to be more practical. This division consider- 
ably reduces cross modulation and also improves the signal- 
to-noise ratio on the sub-carrier channel. Such an improve- 
ment in signal-to-noise ratio is realized as an improvement 
in range and performance of the stereo broadcast since the 
sub-carrier signal-to-noise ratio dominates the system. The 
increase in loss on the main program from 3 db to 6 db 
would be scarcely noticeable in view of the relatively large 
signal-to-noise ratio present on the main channel. 

The use of a single multiplex channel, with the complete 
transmitter devoted to stereo transmission only, is the op- 
timum transmission system for many reasons. The first 
advantage is that the multiplex receiving adapter is the sim- 
plest possible since there is no requirement of separating 
multiplex channels. The adapter filter then only has the 
requirement of rejecting the main channel and selecting the 
sub-carrier. The second advantage is that all of the avail- 
able modulation power for the sub-carrier may be concen- 
trated on one channel without the requirement of sharing 
it with another sub-carrier. In addition there is more 
available spectrum for applying a relatively high frequency 
deviation to the sub-carrier to thereby obtain the maximum 
efficiency of the stereo transmission system. The third 
advantage is that there is sufficient spectrum to transmit 
the full fidelity of 15-kc modulation range the same as is 
applied to the main channel. The fourth advantage is that 
cross modulation problems are considerably simplified. The 
transmission of a third channel on a Subsidiary Communi- 
cations Authorization increases the possible cross modula- 
tion situations from two to six. The required reduction in 
modulation level devoted to the sub-carriers, together with 
the lowered level of frequency deviation of the sub-carriers, 
further increases this problem. Receiver and transmitter 
maladjustments are then greatly emphasized since they may 
bring forth many possible combinations of cross modulation. 


Discussion 


Q. Has anyone estimated the cost of the receiver adapter 


which would handle the system? 


A. I would make an estimate, in low quantities, of around 


the $89.50 range. As the volume goes up, it should 
go below that. 
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In the sum-and-difference transmission a different type 
of distortion acts on each channel. The difference 
product goes through a sub-carrier which distorts it by 
processes unique to the sub-carrier. This is different 
from the processes which add distortion to the sum 
product which goes through the main channel. In the 
receiver, these products are added to and subtracted 
from each other. Since the distortions are not highly 
correlated, the resultant distortion comes from the most 
distorted channel. Have you any data on the overall 
channel result which you get with typical signals using 
this system with a suitably designed receiver, or do 
you have any special conditions that you would impose 
on the receiver? 

We found that we get more distortion on the main 
channel than we do on the sub-carrier, so that the re- 
sulting distortion in the stereo system would be that 
present in the main channel. 


Can you give me any idea of the accuracy of summa- 
tion and subtraction required in the transmitter and 
receiver ? 

That is concerned with the degree of cross modulation 
this system will stand. It is a balance, and I would 
say 20 db would be a normal value. 


As I understand it, most of the noise occurs on the 
A-—B channel, as you currently use the system. One 
loudspeaker receives the (A + B) + (A-—B), and the 
other (A+ B)-—(A-B). Therefore with regard to 
the noise, the two speakers are out of phase. However, 
I think you could add and subtract them in a slightly 
different manner which would make them in phase with 
regard to this noise signal. I was curious as to whether 
the noise would be more objectionable in the latter 
situation. 

I don’t think there would be a particle of difference. 
We have made that switch, where the phase was 
changed to try to get the monaural reception right in 
the center. There is a wrong and a right phase in 
this case, but the noise characteristics just weren’t 
different. 


What does A — B alone sound like? 
It is lacking in low frequencies. This is a directive 


pattern situation, such that the low frequencies cancel 
out, and it is fairly easy to determine by mere listen- 
ing. In the case of A + B, the low frequencies add. 


What sort of cross modulation do you get between the 
two channels at the output end? Suppose a different 
program were transmitted on each channel. 

With our sub-carrier system, the cross modulation is 
far better than 35 or 40 db at low frequencies, but 
gets poorer at the high frequencies. It is a matter 
of expense in filter design. Since stereo will stand just 
so much cross modulation, it becomes a matter of 
making the decision: Should we overdo the filter and 
make the receiver too expensive, or should we make a 
more practical compromise? 


Has the F.C.C. taken any position, or indicated serious 
interest, in this as a possible adoption by the broadcast 
field? 

No, but they know about it. I submitted this system 
when they had their docket out on multiplex trans- 
mission, and it has been in their files for some time. 
I do know that in allowing stations to put stereo on 
the air, they have cautioned about impairing one pro- 
gram with respect to the other, such as putting highs 
on one channel and the lows on the other. The sum- 
and-difference technique is compatible with their atti- 
tude since the reception of the monaural listener would 
not be impaired ii the sum were fed into the main 
channel. 
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An Artificial Stereophonic Effect 
Obtained from a Single Audio Signal” 


MANFRED R. SCHROEDER 
Bell Telephone Laboratories, Inc., Murray Hill, New Jersey 


In 1954, H. Lauridsen reported an experiment which gave a remarkable stereophonic effect 
using a single input signal. The signal was fed to both ears once directly in phase, and a second 
time (delayed by 50 to 150 msec) in anti-phase. This experiment is recognized as resulting in two 
different intensity vs frequency responses in the paths to the two ears, namely, two interlaced comb 
filters. The presence of such different responses, rather than the time delay, is postulated, from the 
results of the experiments described here, as the essential prerequisite that stereophonic effects 
be obtained from a single input signal. 


INTRODUCTION 


- 1954, H. Lauridsen of the Danish National Broadcast- 

ing System discovered a remarkable stereophonic effect 
using a single audio signal. The subjective impression is 
that of sound coming from all directions resulting in a 
unique spatial illusion and an enhanced sensation of pres- 
ence and immediacy for the listener. 

In order to obtain this effect the signal is applied to both 
ears in phase and, in addition, a delayed (50-150 ms) ver- 
sion of the same signal is fed to both ears in anti-phase. 
This experimental condition can be described in terms of 
two amplitude responses and a differential phase response 
for the paths to the two ears. The amplitude responses 
are two complementary comb-filters. The differential phase 
response is a meander-type function jumping discontinu- 
ously between -7/2 and -+-7/2 at frequencies which are in- 
teger multiples of 1/27, where + is the delay. 

A number of experiments have been performed involving 
complementary comb-filters of different kinds and various 
types of differential all-pass filters in order to distinguish 
between the relative contributions of the intensity varia- 
tions and phase variations, respectively. 

The result of these experiments is that “wiggly” differ- 
ential phase responses alone are not sufficient to give rise 
to the baffling spatial illusion first reported by Lauridsen. 
Rather, strong spectral intensity modulations, preferably of 
a complementary type, are required. These findings and 
the fact that the original experiment can be successfully 
duplicated with delays as short as 2.5 ms suggest that the 
delay is not the primary agent of this effect. Instead, the 


* Original manuscript received October 3, 1957. Revised manu- 
script received April 1, 1958. Paper delivered at the Ninth Annual 


Convention of the Audio Engineering Society, New York, October 
11, 1957. 
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simple fact that some frequencies enter the head through 
one ear while others prefer the other ear as their path is 
offered as an explanation for the psychoacoustic illusion of 
“sitting in the middle of the orchestra” (if orchestral music 
is played). 


Background 


The “pseudo-stereophonic” effect was obtained by H. 
Lauridsen' in a manner sketched in Fig. 1. By pseudo- 
stereophonic we designate a situation in which a spatial 
subjective impression is created by applying to the two ears 
two different versions of a single audio signal. The spatial 
effect created in this manner can never be a simple and 
reversible mapping of the real space from which the signal 
originated into the subjective space. For true stereophony 
one needs at least two independent signals. 

Figure 1 shows Lauridsen’s setup. The signal from a 
single microphone is applied to one loudspeaker directly 
and to another one, at right angles with the first one, de- 
layed by 50 to 150 ms. If the observer P is located on 
the center line the interferences at his two ears will be of 
opposite sign. For certain frequencies the contributions 
from the two loudspeakers will add at one ear and subtract 
at the other ear, while for a different frequency nearby, 
depending upon the delay, the effect is reversed. With no 
interference from echoes, and with equal amplitudes at P 
of the “point source” speaker f(t-r), and the “dipole 
source” speaker f(t), the signals at the two ears of P will 
be f(t-r) + f(t) and f(t-r) —f(t), respectively. 

Figure 2 illustrates another of Lauridsen’s expériments 
where the adding and subtracting is done electrically, not 
in the air as in Fig. 1. The complex frequency responses in 


1H. Lauridsen, “Nogle forség med forskellige former rum akustik 
gengivelse,” Ingenigren, No. 47, 906 (1954). 
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f F(t-7) +F(t) a 
wot f (t-7) . ®;(o) = -or/2 -2¥ o(or —2n2r-7), (8) 
— > n=0 
F(t-7)-F (t) @2(0) =-wr/2 + 9/2-25 o(ur-2ne). (9) 
n=0 
DELAY 


AMPLIFIER | 


Fic. 1. Lauridsen’s binaural experiment with two loudspeakers 
at right angles to each other. (Actually, Lauridsen delayed the 
signal to the left speaker.) 


the paths to the two ears are independent of the geometric 
and room-acoustical restrictions of the setup sketched in 


Fig. 1, 
S;() = exp [ior] + 1 = 2 cos(wr/2) exp [-iwr/2], (1) 
S2(w) = exp [-ior] — 1 = -2 isin(wr/2) exp[-—iwr/2]. (2) 


These two relations can be rewritten in terms of positive 
amplitude responses and phase responses (for positive fre- 
quencies) 


Si(w) = 2 | cos(wr/2) | (3) 
exp [-» i( or + Ne (wr — x — 2nr) ) | P 

S2() = 2 | sin(wr/2) | 7 (4) 
exp [-» i( otr—a+2n Ye (wr — 2nx) ) | ‘ 


Here o(x) = 0 for x <O and o(x) = 1 forx =O. 
We have thus two sinusoidal intensity responses, 


|S, (w) |? = 4 cos? (wr/2) (5) 

| So(w) P=ser (wr/2) (6) 
with . 

| Si(m) |? + | So(w) |? = 4. (7) 


That is, the sum of the intensities at the two ears is constant 
and independent of frequency. The two complementary in- 
tensity responses are plotted in Fig. 3. 

In addition to the frequency responses we have two dif- 
ferent phase responses. 


F (t-7) +f (t) 


pw DELAY T a, 


F(t -7)-F(e) 


Fic. 2. Lauridsen’s binaural experiment with earphones. 


In this report we will be little concerned with absolute 
phase. The phase difference between the two ears is the im- 
portant quantity. From (8) and (9) we obtain 

A®(w) = 9; — Py — 


—7r/2+ > [o(wr — 2nr) — o(wr — 2nx - x) | 
n=0 


or 
A®(o) =—2/2 + wr o(sin or). (10) 

This differential phase response is plotted in Fig. 4. 

We have thus two distinct effects: 

(1) Two sinusoidally varying intensity vs. frequency 

responses. 

(2) A meander-type differential phase response. 

Which of these two effects is more important in obtaining 
a pseudo-stereophonic effect? A number of experiments are 
described which were designed to differentiate between these 
two influences. 
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Fig. 3. Two complementary comb filters obtained by means of 
a delay line. 


Comb-Filter Experiments 


There are ways different from the one used by Lauridsen 
to “split” the frequency domain between the two ears in 
comb-filter fashion. In Fig. 5 for example, we have a bank 
of contiguous band-pass filters such as is used in sound 
analyzing equipment. If we connect all odd-numbered 
filters together and feed the sum to one ear and the outputs 
of all even-numbered filters to the other ear, we have again 
two complementary comb-filters with responses similar to 
the ones shown in Fig. 3. An experiment with this arrange- 
ment was performed using 16 filters, each being about 200 
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Fic. 4. Differential phase characteristic obtained by means of a 
delay line. 


cps wide. As one would expect from the close analogy 
with the original setup of Fig. 2, the pseudostereophonic 
effect is just as striking as in Lauridsen’s experiment. Of 
course, even the differential phase responses are similar. 
Only the phase variations at the crossover frequencies may 
be greater than x depending on the complexity of the filters 
used. 

However, there is one important difference: a filter width 
of 200 cps corresponds to a delay of only 2.5 ms in the 
circuit of Fig. 2. In contrast, the delays used by Lauridsen 
were between 50 and 150 ms. Here we obtain the same 
result with an equivalent delay of 1/10 or less. This ob- 
servation suggests that the comb-filter-type frequency split- 
ting between the two ears is the major cause of the strange 
effect observed first by Lauridsen. The fact that certain 
frequencies enter the auditory senses predominantly through 
one ear and other frequencies predominantly through the 
other ear seems a seductively simple explanation. It would 
also explain why the effect is so strong at very low and very 
high frequencies for which an explanation in terms of delay 
is impossible.” 


J 
3000 -3320 


Fic. 5. Two complementary comb filters obtained by splitting 
a bank of contiguous band-pass filters. 


2G. R. Schodder, “Vortaeuschungen eines akustischen Raumein- 
drucks,” Akust. Beihefte, Acustica, 6, 482 (1956). See also R. L. 
Hanson and W. E. Kock, “Interesting Effect Produced by Two Loud- 


speakers Under Free Space Conditions,” J. Acoust Soc. Am., 29, 145 
(1957). 
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Experiments with Phase-Linear Comb-Filters and 
All-Pass Filters 

In order to shed more light on the possible causes of the 
pseudo-stereophonic effect, a number of experiments in- 
volving phase-linear comb-filters and all-pass filters were 
conducted. The idea was to separate the amplitude effect 
shown in Fig. 3 from the phase effect shown in Fig. 4 and 
to assess their relative contributions. The result of these 
experiments is that the intensity variations are more im- 
portant than the phase variations. All experiments with 
all-pass filters gave only weak spatial illusions compared to 
the strong effect obtained with either Lauridsen’s setup or 
other complementary comb-filters. In contrast, using a pair 
of phase-linear comb-filters described below resulted in a 


strong spatial effect even though the delay is the same for 
both ears. 


F(t) +f @-27) 


+ + 
= DELAY DELAY 
sf T 
F(t) +2F (t-7)+F(t-27) 
2f (-7) 
F(t)+Ff t-27) A ~F() +2F (t-1) -f (t-27) 


Fig. 6. A pair of phase-linear comb filters. 


Figure 6 shows the circuit used to realize a pair of phase- 
linear comb-filters. The complex frequency responses are 


1 + 2 exp [-iwr] -+ exp [-2ie7 ] = (11) 
4 cos* (wr/2) * exp [-iwr], 
—1 + 2 exp [-iwr] — exp [—2ier | = (12) 


4 sin* (7/2) + exp [-iwr]. 
Since both cos? (wr/2) and sin? (wr/2) are nowhere negative 
the phase angles are in both cases -wr radians. Thus there 
is no phase difference between the signals at the two ears. 
But the amplitude responses are similar to those in the 
original experiment. In fact, they are identical to the in- 
tensity responses shown in Fig. 3. 

As expected the effect was as pronounced, if not more so, 
as in Lauridsen’s case. The reason why the effect could be 
stronger may be found in the fact that the amplitude re- 
sponses have less overlap than in the original experiment.* 


3A possible measure of overlap of two filters is the following ratio 


Ff | Silo) | * | S2(@) | do 
[f | Si(w) |2do* f | S2(w) 2 do]* 


In the original experiment R — 0.636. Here R = 0.333, indicating less 


overlap of the two frequency responses. 
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AN ARTIFICIAL STEREOPHONIC EFFECT OBTAINED FROM A SINGLE AUDIO SIGNAL 


Fie. 7. Differential phase Ap = + 2/2. 

While the experiments with a pair of phase-linear comb- 
filters prove that a spatial illusion can be obtained without 
a phase difference of the signals at the two ears, the ques- 
tion remains unanswered whether different phases in the 
absence of spectral intensity modulations are capable of pro- 
ducing the spatial illusion. To this end, a number of ex- 
periments with all-pass filters have been performed. Figure 
7 shows the circuit used to duplicate the differential phase 
characteristic plotted in Fig. 4 without appreciable inten- 
sity fluctuations. It consists of a bank of comb-filters and 
simple R-C circuits as phase shifters. The differential 
phase at the outputs is + 7/2. Figure 8 shows a similar 
circuit with differential phase +7. The arrangements 
shown in Fig. 9 and Fig. 10 vary in their input-output 
phase relations but have the same differential phase at their 
outputs. Experiments with any of the arrangements shown 
in Figs. 7-10 give rather similar effects. While they differ 
from a diotic presentation of a single signal, they do not 
result in a strong spatial effect. In fact, the perception is 


PHASE 
INVERTERS 


Fic. 8. Differential phase Ap = + z. 


SHIFTERS 
+1/2 


FILTERS 


-1/2 


-/2 


+m/2 


+17/2 
Fic. 9. Differential phase Ap = + x. 


quite comparable to that obtained from an inversion of one 
of two identical signals. The simple circuit for this condi- 
tion is pictured in Fig. lla. It should be noted that this 
connection results in a constant differential phase in con- 
trast to the oscillatory phase characteristics of the previous 
circuits. Another filter having a constant differential phase 
at its outputs and giving similarly weak results is sketched 
in Fig. 11b. It is the well known 7/2 all-pass filter. 

For completeness another simple phase filter should be 
mentioned; a delay in the path to one ear shown in Fig. 1 1c. 
While this is another case of an all-pass filter, it lacks the 
symmetry of the previously described arrangements. One 
ear receives the signal earlier than the other. If this time 
difference is great enough, it is noticeable and gives the 
impression of the sound coming from one side, or, for even 
greater delays, the signal and “echo” are heard separately. 

BANO 


PASS 
FILTERS 


PHASE 
INVERTERS 


Fig. 10. Differential phase Ap = + x. 
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Fic. 11. Three all-pass filters with various differential phase 
characteristics. 


However, no real three-dimensional impression is conveyed. 
As a curiosity it may be mentioned that a loudspeaker 
and two microphones in a reverberant room can be used to 
realize a filter with two different outputs. This situation 
is depicted in Fig. 12. If the distances between the two 
microphones and the microphones and the loudspeaker are 
large enough, the two outputs will be uncorrelated. Listen- 
ing over a pair of earphones connected to the outputs of this 
“mechanical” filter results in the familiar spatial illusion. 


Conclusions 


Lauridsen’s binaural experiment was duplicated using a 
bank of contiguous band-pass filters. While it is not sur- 
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Fic. 12. Reverberant room used as a ‘‘mechanical’’ filter with 
one input and two outputs. 
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prising that comb-filtering realized by band-pass filters 
should give the same effect as comb-filtering realized by the 
use of a delay line, because of the similar amplitude and 
phase characteristics in both cases, a significant finding 
results from the experiments described here: for the delays 
used by Lauridsen (around 100 ms) the bandwidth of an 
individual “tooth” in the comb-filters is approximately 5 
cps. The band-widths used here are around 200 cps cor- 
responding to an equivalent delay of 2.5 ms. This result 
makes an explanation of the pseudostereophonic effect by 
analogy to room-acoustic delays (between direct sound and 
echoes) look farfetched. Rather, the simple fact that some 
frequencies enter the auditory center through one ear while 
other frequencies travel by way of the other ear may well 
be the real cause of the extraordinary effect discovered by 
Lauridsen. 

A variety of experiments with all-pass filters are inter- 
preted as supporting this view. In these cases there are no 
significant intensity differences between the two ears. In 
one experiment (Fig. 4) the differential phase response is 
very nearly identical to the one obtained in the original 
experiment with earphones. But the strong pseudo-stereo- 
phonic effect is missing in all experiments not involving 
strong spectral intensity modulations. This establishes dif- 
ferent amplitude responses as a necessary condition for the 
binaural effect reported by Lauridsen. 

A pair of comb-filters was constructed that have identical 
linear phase responses. The successful experiments with 
this filter pair eliminate inter-aural phase differences as a 
necessary condition for auditory spatial illusions. 

The technical assistance of Mr. H. W. Hines in this study 
is gratefully acknowledged. 


Discussion 


Q. (Paul Conklin, Baldwin Piano Co.): The fact that in 
the original experiment, as you have it set up here, the 
second speaker is unbaffled suggests that low frequency 
response is not essential in order to produce this effect. 
I was wondering if you have made any controlled ex- 
periments with comb-filters by limiting the frequency 
response to the upper frequency region to see to what 
extent frequency response affects this effect. 


A. Ideally, of course, the two frequency responses apart 


from the comb-filter effect should be the same. In this 
particular experiment the intensity you get from this 
upper speaker and the intensity that you get from the 
lower should be the same. This is true only for a lim- 
ited range in this room; the frequency characteristics 
are also different and coincide only in a limited range, 
say from 200 or 300 cps to several kc. Even that may 
be an optimistic statement. Here in this loudspeaker 
demonstration, you don’t have the clearcut relations 
which you have in an earphone demonstration. 
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If, in the earphone demonstration, you were to delib- 
erately restrict the frequency range over which these 
amplitude fluctuations occur, what effect would this 
have on the pseudo-stereophonic impressions? Could 
you restrict the effect to high frequencies alone, say, 
above 2 or 3 kc? 


No. The point in this effect is that you have two comb- 
filters over the whole frequency range which you actu- 
ally want to present. You can limit your signal at the 
high and the low end, but you get a poorer signal, not 
as “hi-fi.” Actually the signal we have on the ear- 
phones, and also on the speakers, is limited to around 
4 kc, because the delay lines we used worked only up 
to 4 kc. If you make the band even narrower, the 
signal sounds poorer, but you have the effect still. 


I just wanted to make a comment. I was very sur- 


prised to observe the apparent bass response of the little 
unbaffled speaker; surprised because it should be very 
inefficient. 
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Stereophonic Sound Reproduction in the Home” 


APRIL 1958, VOLUME 6, NUMBER 2 


Harry F. Otsont 


RCA Laboratories, Princeton, N. J. 


Realistic sound reproduction in the home implies solution of those problems which accompany 


spatial compression. The author draws upon his wide experience in this subject to describe the 
procedure and results of the psychoacoustical experiments designed to isolate the factors which 
contribute realism to sound reproduction in home-size listening rooms. He concludes that while 
true concert-hall acoustics cannot be simulated, the mechanical and electronic means to achiéve 
a reasonable facsimile are available, and describes such a system for the playback of stereophonic 


magnetic tapes. 


INTRODUCTION 


ACHIEVE realism in a sound reproducing system 
three fundamental conditions must be satisfied, namely: 
first, the frequency range must be such as to include all 
the audible components of the various sounds to be repro- 
duced; second, the volume range must be such as to permit 
noiseless and distortionless reproduction of the entire range 
of intensity associated with the sounds; third, the reverbera- 
tion and spatial characteristic of the original sound must be 
preserved. During the past decade many experiments and 
tests have been conducted to determine the factors which 
play the major roles in establishing realism in reproduced 
sound under practical operating conditions in the home. 
It is the purpose of this paper to describe some of the 
fundamental experiments that have been carried out which 
have formed the basis for the systems and techniques which 
have been developed to pick up and record sound by stereo- 
phonic means in the studio and to reproduce the recorded 
sound by stereophonic means in the home. 


ACOUSTIC FREQUENCY PREFERENCE TEST* 


In order to establish the validity of the first condition, 
namely, that the frequency range must be such as to include 
all the frequency components in the audio range, a funda- 
mental all-acoustic frequency preference test was conducted. 
The general arrangement of the test is shown in Fig. 1. An 
acoustical filter is placed between the orchestra and the 
listeners and is arranged so that it can be turned in or out 
of operation, as shown in Fig. 1. The response frequency 
characteristic of the acoustic filter approximates commer- 


* Manuscript received October 7, 1957. Presented at the Ninth 
Annual Convention of the Audio Engineering Society, New York, 
October 11, 1957. 

t Director, Acoustical and Electromechanical Research Laboratory. 

1 The all-acoustic frequency experiments were performed in 1946. 
A complete description of these experiments is reported by H. F. 
Olson, J. Acoust. Soc. Amer., 19, 549 (1947). 
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cial radio or phonograph reproduction in the high frequency 
range. The acoustical filter is composed of ten units with 
each pivoted at the top and bottom. The ten units are 
coupled together and rotated by means of a lever. In this 
way, the acoustical filter can be placed in or out of opera- 
tion by merely turning the units through 90°. The acous- 
tical filters are shown in the full frequency range position 
in the plan view of Fig. 1. A sheer cloth curtain which 
transmits sound with no appreciable attenuation up to 10 
kc, and less than 2-db attenuation in the range from 10 kc 
to 15 ke is placed between the acoustical filter and the 
listeners. The curtain is illuminated so that the listeners 
cannot see what transpires behind the curtain. The par- 
ticular condition, that is, the full frequency range or 5-kc 
low-pass transmission, is shown on the A-B indicator. 

The tests were conducted in a small room as shown in 
Fig. 1. This room simulates the average living room in 
dimensions and acoustics. The orchestra was a six-piece 
dance band playing popular music. The average sound 
level in the room was 70 db. The changes from wide open 
to low pass to wide open, etc., were made every 30 seconds. 
Two selections of popular music were played and the lis- 
teners were asked to indicate a preference. The results of 
the tests have indicated a preference for the full frequency 
range. Similar tests have been made for speech. The 
preference for speech is also for the full frequency range. 
There is a distinct lack of presence in speech with limited 
frequency range. 

The all-acoustic frequency preference test establishes the 
validity of the first condition, namely, that the frequency 
range must include all frequency components in the audio 
range. 


NONLINEAR DISTORTION TEST? 


In an ideal sound reproducing system the elements are 
invariant with respect to time. However, in a practical 
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Fie. 1, Plan and elevation views of the apparatus for the acoustic frequency preference 
testing of speech and music, including a sectional view, acoustical network, and the response 
frequency characteristic of the acoustical filter and the results for speech and music. 


system all the elements are nonlinear. These nonlinear 
elements introduce nonlinear distortion. Some idea of the 
effect of nonlinear distortion can be obtained from a study 
of the masking curves* of the ear. From these data it will 
be seen that the higher-order harmonics are noticeable at 
a much lower level than the lower-order harmonics. Fur- 
thermore, as the high-frequency range is increased the effects 
of the harmonics are more noticeable. In complex waves 
of speech and music, sum and difference tones are also an 
important phase of the problem of nonlinear distortion. 
The subjective effect of the various types of nonlinear 
distortion of speech and music was determined by means 
of the system shown in Fig. 2. The overall response fre- 
quency characteristic of the microphone, amplifier, and 
loudspeaker was uniform to within 2 db from 45 to 15 000 


2The nonlinear distortion tests were performed in 1946. A de- 
scription of the tests was first given in H. F. Olson, Elements of 
Acoustical Engineering, (D. Van Nostrand Company, Princeton, 
N. J., 1947). 

3H. F. Olson, Acoustical Engineering, (D. Van Nostrand Company, 
Princeton, N. J., 1957). 
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cps (Fig. 3). Low pass filters were provided with cutoff 
frequencies at 3 750, 5000, 7 500, and 10000 cps. The 
nonlinear distortion in the overall channel was less than 
0.25%. In addition, the distortion components were prin- 
cipally second and third harmonics. The pickup studio for 
these tests was the free-field room. The sound was repro- 
duced in a room with dimensions and acoustics similar to 
that of the average living room. The noise level at the 
pickup point was from 0 to 10 db. The noise level in the 
listening room was about 25 db. 

Two types of distorting amplifiers were used, a single- 
ended triode and a single-ended pentode. The distortion 
components for these two power output systems for various 
power levels are shown in Fig. 4. 

These tests were limited to three subjective gradations 
of nonlinear distortion: perceptible, tolerable, and objec- 
tionable. Perceptible is the amount of distortion in the 
distorting system required to be just discernible when 
compared to the reference system. Tolerable and objec- 
tionable are not as definite and are a matter of opinion. 
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Fic. 2. Schematic arrangement of the apparatus for a subjective determination of the rela- 
tion between nonlinear distortion and frequency range preference in reproduced sound. 
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Fic. 3. Experimental results of subjective tests of reproduced 
speech and musie depicting objectionable, tolerable and perceptible 
nonlinear distortion for various high frequency cutoffs. The num- 
bers in the distortion data correspond to the numbers which label 
the response frequency characteristic. The distortion in per cent is 
the ratio of the total RMS of the second, third, fourth, ete., compo- 
nents of distortion to the RMS of the fundamental, multiplied by 
one hundred. 


By tolerable distortion is meant the amount of distortion 
which could be allowed in medium grade commercial sound 
reproduction. By objectionable distortion is meant the 
amount of distortion which would be definitely unsatisfac- 
tory for the reproduction of sound in phonograph and radio 
reproduction. 

Both speech and music were used in making these tests. 
In the case of music a six-piece orchestra was employed. 

The average results of a few of these tests, with a limited 
number of critical observers, are shown in Fig. 3. As would 
be expected from the frequency ranges of speech and music 
together with the masking curves, a distorting system with 
high-order harmonic components is more objectionable than 
one with low-order components. The amount of tolerable 
distortion is greater for speech than for music. 

The objective and subjective tests on the determination 
of objectionable, tolerable and perceptible nonlinear dis- 
tortion establishes the following facts: (1) Sound reproduc- 
ing systems with no perceptible nonlinear distortion can be 
designed and built provided considerable care is taken in 
the design and construction of the equipment, (2) Sound 
reproducing systems with tolerable nonlinear distortion can 
be designed and built employing conventional techniques 
in widespread commercial use today. 


. OLSON 


STEREOPHONIC FREQUENCY PREFERENCE TEST‘ 


Two fundamental facts have been established in the pre- 
ceding section, namely, the frequency range preference for 
live speech and music and the perceptible, tolerable and 
objectionable distortion in a sound reproducing system. 

The next step is reproduction of the sound by stereo- 
phonic means with a system having low nonlinear distortion. 
Specifically, the problem is to “transfer” the reproduced 
orchestra to the position of the live orchestra by means of 
microphones, amplifiers and loudspeakers. The first step 
then becomes the elimination of the acoustics, i.e., the re- 
verberation characteristics, of the pickup studio. This was 
accomplished by the use of a large free-field or anechoic 
room for the studio. 

The floor plans of the free-field room used as a studio 
and of the listening room and the general arrangement of 
the tests are shown in Fig. 5. The idea of the test was to 
“transfer” the orchestra to the listening room by means of 
microphones, amplifiers, and loudspeakers. In order to 
simulate the all-acoustic tests in this transfer of the orches- 
tra, it is obviously necessary that the studio be devoid of 
acoustics, that is, reverberation. To obtain these condi- 
tions the free-field room was used as the studio. With the 
orchestra playing in the free-field room, the level of the 
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Fig. 4. Distortion characteristics of single-ended pentode and 
triode amplifiers. 


4The stereophonic frequency preference experiments were per- 
formed in 1948. A description of these experiments is given in H. F. 
Olson, Acoustical Engineering, (D. Van Nostrand Company, Prince- 
ton, N. J., 1957). 
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Fie. 5. A sehematie plan view of free-field room, the listening 
room, and the apparatus for the frequency range preference testing 
of stereophonic reproduced speech and music. 


reflected sound at the microphone for steady state sound 
conditions is about 50 db below the direct sound. There- 
fore, it is impossible to detect any acoustics of the studio 
in the reproduced sound. 

The listening room is the same as that used in the all- 
acoustic test, and, as noted before, was designed to be the 
acoustical equivalent of an average living room. 

The reproducing system used in these tests employed 
two channels. Each channel consisted of an RCA 44BX 
velocity microphone, an RCA OP-6 amplifier, a laboratory- 
developed triode push-pull power amplifier, laboratory- 
developed high- and low-pass electrical filters, and an RCA 
LC1A loudspeaker. 

The overall response frequency characteristics depicting 
the ratio of the sound pressure output from the loudspeaker 
in free space to the sound pressure at the microphone in free 
space, with and without the electrical filters, are shown in 
Fig. 6. In the restricted range condition there is attenuation 
in both the low- and high-frequency ranges. The product of 
the low- and high-cutoff frequency is 500,000 (cycles)*. 
This value has been recommended by many investigators.® 


5 Hanson, Rackey and Nixon, Down to Earth on High Fidelity, 
Eng. Dept., NBC, New York, N. Y., March 27, 1944. 


In this test there is a deviation from the original all-acoustic 
frequency preference test in which a high-frequency cutoff 
alone was used. An argument in favor of the combination 
of high- and low-frequency cutoffs is that it approximates 
conventional radio and phonograph response frequency 
characteristics. 

The directivity pattern of the loudspeakers is very im- 
portant where the listeners are located at relatively large 
angles with respect to the loudspeaker. In the loudspeakers 
used in these tests the variation in response at any fre- 
quency over a total angle of 90° is less than + 2 db. 

The nonlinear distortion is another important factor in 
reproduced sound. The overall nonlinear distortion was 
measured by supplying a distortionless signal to the input 
of the chain consisting of the voltage amplifier, power 
amplifier, and loudspeaker. The sound output of the loud- 
speaker was picked up by the microphone and fed to a 
harmonic analyzer. This method of measurement provides 
an overall distortion characteristic from sound input to the 
microphene to sound output of the loudspeaker The total 
nonlinear distortion measured at the peak level of the re- 
produced sound was less than 0.3 per cent. From the results 
reported in the preceding section, it will be seen that this 
value of nonlinear distortion is sufficiently low to be prac- 
tically imperceptible. 

The level of the reproduced sound in the listening room 
is important in any subjective test. Tests have shown that 
a peak level of about 70 db to 80 db is most pleasing for 
serious listening in a small room. The average peak sound 
intensity level on a standard level indicator was 75 db. 

The same six-piece band was used in these tests as in 
the case of the all-acoustic frequency range tests. The 
change from full frequency range to restricted frequency 
range was made every 30 seconds. The results of these 
tests indicate a preference for the full frequency range. 
Similar tests were made for speech, with the same result. 

The stereophonic frequency preference tests demonstrate 
that a reproducing system with a low order of distortion 
and stereophonic sound reproduction yields the same re- 
sults as the all-acoustic frequency preference test. 
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Fig. 6. Over-all response frequency characteristic of the two fre- 
queney range conditions used in the frequency range preference 
tests for stereophonie reproduced speech and music, A. Wide fre- 
quency range. B. Restricted frequency range. 
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SCHEMATIC DIAGRAM OF EXPERIMENT 
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Fig. 7. Schematic arrangement of the apparatus which illustrates 
the effeet of phase delay and amplitude upon the localization of a 
sound source. The graph shows the ratio of the voltages E, and E, 
as a function of the delay for the condition when it is impossible to 
distinguish which loudspeaker is the source. 


NOISE TESTS 


Noise usually determines the lower limit of reproduction 
in a sound reproducing system. In the sound reproducing 
system used in the “Nonlinear Distortion Test” and the 
“Stereophonic Frequency Preference Test” there are two 
separate noise sources, namely, the ambient noise level in 
the free-field room and the listening room. The noise level 
in the free-field room was 0 to 10 db. The noise level in 
the listening room was 25 db. The noise level in the sound 
reproducing system as reproduced in the listening room 
was less than 10 db. Under these conditions the listening 
room was the limiting factor on noise. In this connection, 
it should be pointed out that the listening room was a rela- 
tively quiet room in view of the fact that the noise level® 
in the average residence is 42 db. 

The low order of noise in the reproducing system con- 
sisting of a microphone, amplifier, and loudspeaker used in 
these experiments cannot be achieved when a record is used 
for the reproduction of the sound. However, subjective 
tests of high grade commercial disk and magnetic tape 
records and reproducers have shown that the noise in these 
reproducing systems is of no practical consequence when 


6D. F. Seacord, J. Acoust. Soc. Am., 12, 183 (1940). 
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reproduced sound source. 


the sound is reproduced under the noise level conditions 
of the average residence. 


LOCALIZATION TESTS’ 


From the standpoint of the listener, in any true stereo- 
phonic sound reproducing system there should be a sub- 
jective effect of a distribution of the reproduced sound 
sources in lateral directions as well as in depth. It is the 
purpose of this section to describe a series of experiments 
that demonstrate stereophonic effects in sound reproduced 
by means of a two-channel system. 


A. Lateral Localization 


There are two factors that determine angular or lateral 
localization of a source of sound, namely, phase and inten- 
sity. An experiment which demonstrates angular or lateral 
localization with regard to phase and intensity is shown 
in Fig. 7. The same signal is reproduced from loudspeakers 
1 and 2. The signal from loudspeaker 1 can be delayed 
by means of the delay system. For each value of delay, 
the ratio of the voltage input to the two loudspeakers is 
varied until it is impossible to distinguish which loudspeaker 
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Fig. 8. Schematic arrangement of the apparatus which illus- 
trates the effect of the relative delay and amplitude of two spaced 
sound sources in determining the apparent lateral location of the 


7 The localization tests were performed over a pericd of time ex- 
tending from 1949 to 1955. 
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Fic. 9. Schematic arrangement of the apparatus of a two- 
channel stereophonie system showing the locations of the sound 
sources in the free-field room and the apparent sound sources as 
reproduced in the listening room. 


appears to be the source. The results of this test are shown 
by the graph of Fig. 7. This experiment shows that there 
can be considerable unbalance in amplitude before the 
sound ceases to appear to come from the undelayed source, 
and therefore, that both phase and intensity play a part 
in angular localization. 

The experiment of Fig. 7 was carried further to deter- 
mine the exact subjective location of the sound. 

In performing the lateral localization experiments, some 
precautions must be taken to avoid errors in observation. 
For example, if the observer can see the loudspeakers he 
tends to point to either one or the other as a source. This 
is a natural state of affairs because there are two visible 
sources of sound, and, obviously, the sound must come from 
one or the other. In order to avoid this subjective bias, the 
loudspeakers were located behind a light-opaque, sound- 
translucent screen. The arrangement* is shown in Fig. 8. 

When the intensity and the phase of the sound emanating 
from the two loudspeakers is the same, the sound appears 
to originate from a point midway between the two loud- 
speakers, designated as S;’. If the intensity of the sound 


emanating from both loudspeakers is the same but the 
phase of loudspeaker A’ of Fig. 8 is delayed 2 msec with 
respect to loudspeaker B’, the sound appears to come from 
the point S,’ of Fig. 8. If the delay is increased to 5 msec, 
the sound appears to come from point S;’ of Fig. 8. If the 
intensity of A’ is made 5 db higher than B’, and A’ is de- 
layed 5 msec with respect to B’, the sound appears to come 
from point S,’. If the intensity of B’ is made 5 db higher 
than A’ but the phase of the two is the same, the sound 
appears to come from point S;’. If the intensity of B’ is 
made 2 db higher than A’, but the phase is the same, the 
sound appears to come from point S,’. If the intensity of 
B’ is 5 db higher than A’ and the sound from loudspeaker 
A’ is delayed about 5 msec, the sound appears to come from 
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Fic. 10. Schematic arrangement of the apparatus of a two- 
channel stereophonie system showing the arrangement of the 
sound sources in the free-field room which will give a uniform 
spacing of the sound sources at the curtain in the listening room, 


8In the experiments depicted in Figs. 8, 9, 10, and 11, the listener 
was asked to locate the apparent lateral location of the reproduced 
source of sound at the surface of the curtain. In other words, the 
apparent location of the source of sound with respect to depth was 
not determined. In the section on depth localization, experiments 
relating to the apparent location of the source of sound with respect 
to depth will be described. 
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Fig. 11. Schematic arrangement of the apparatus of a two- 
channel stereophonie system using the arrangement of the sound 
sources in the free-field room of Fig. 10 and showing the location 
of the apparent sound sources at the curtain in the listening room 
with the observer located 214 feet from the center line. 


point S.’._ This experiment shows that the apparent posi- 
tion of the reproduced sound can be shifted over wide 
limits by varying the relative phases and/or relative inten- 
sities of the sound sources. 

As an extension of the above experiments, the stereo- 
phonic arrangement of Fig. 9 was employed. The configu- 
ration of the microphones and loudspeakers is shown in 
Fig. 9. A person speaking was located at the different 
positions S,, S2, S;, Sy, and S; in the free-field room. The 
corresponding apparent locations of the reproduced sound 
in the listening room are shown as S,’, So’, S3’, Sy’, and S;’. 
The relative estimated sound levels in db are also shown, 
with positive S,’ arbitrarily designated as the 0-db reference 
level. This experiment shows that the apparent position of 
the reproduced sound follows the actual position of the 
sound in the studio with small deviations. The deviations 
are as follows: The reproduced sound tends to be spread out 
in middle positions. The overall spread of the sound sources 
is less than the distance between the loudspeakers. The 
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reproduced sound level appears to be lower in the middle 
positions. 

In the next experiment an attempt was made to reduce 
the spreading out and reduction of level of the reproduced 
sources in the middle positions. The position of the sources 
in the studio which was found to give practically equal 
spacing of the apparent sound sources in the listening room, 
and the orientation of the microphones required, are shown 
in Fig. 10. The experiment of Fig. 10 shows that it is 
possible to obtain an equal spread of the sounds in repro- 
duction. Furthermore, there is only a slight reduction in 
level for the middle positions. 

In the preceding experiments the listener has been located 
on a line perpendicular to the midpoint of the line joining 
the two loudspeakers. With the studio sound source posi- 
tions the same as in Fig. 10, the listener was next located 
2% feet from the center line, as shown in Fig. 11. The 
location of the apparent sources then appeared as shown. 
The apparent sources are concentrated toward the loud- 
speaker nearest the observer. As would be expected the 
apparent intensity falls off with the distance from the loud- 
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Fie. 12. Schematic arrangement of the apparatus of a two- 
channel stereophonie system showing the arrangement of the 
sound sources in the free-field room and the apparent location and 
relative sound level of the sound sources in the listening room. 


; 86 a 
@;)+—souno SOURCES of é) 
6 Fg | 
; FREE | | ‘e . 
ROOM OF ° | 
P| ; 25 ‘e A SOURCES ft 
| ie ©———@+ 
a —/ 
es Ox T® 5 
a O+ + @01 

| Seg Pe 1 
¥ 
7 (G3) cose 


STEREOPHONIC SOUND REPRODUCTION IN THE HOME 87 


SOUND SOURCES ye) <j 
es 


STUDIO 


OK 
@) 
@) 


DIRECTIONAL PATTERN 


MICROPHONE 


AMPLIFIER 


ATTENUATOR 


AMPLIFIER 


VOLUME INDICATOR 
LOUDSPEAKER 


OS ) OBSERVER 


Fic. 13. Schematic arrangement of the apparatus of a two- 
channel stereophonic sound system showing the arrangement of 
the sound sources in the studio and the apparent location of the 
sound sources in the listening room. The reproduced level of the 
sound was adjusted so that the output was the same for all loca- 
tions of the reproduced sound. 


speaker nearest the observer. In spite of the fact that the 
apparent sources are not equally spaced or of constant in- 
tensity, from a practical standpoint, the stereophonic aspects 
are preserved. 


Sig wiv. | 


B. Depth Localization 


In any true stereophonic sound reproducing system, there 
should be a sense of relative depth as well as a sense of 
lateral distribution. However, in this connection, it is gen- 
erally recognized that the subjective effect of lateral dis- 
tribution is more important. Many factors contribute to 
the subjective sense of depth in stereophonic reproduction 
of sound. These -include differences in: time of arrival of 
the sound from two sources spaced in depth, intensity of 
the sources, response frequency characteristics of the sources, 
and reproduced reverberation of the sources. 

To determine the subjective effect of depth, an experi- 
ment was carried out as shown in Fig. 12. A person speak- 
ing was located at the positions S;, So, Ss, Sy, Ss, and S¢ in 


the free-field room. The corresponding apparent estimated 
locations of the reproduced sound in the listening room are 
shown as S;’, So’, Ss’, Ss’, Ss’, and Sg’. The relative esti- 
mated sound levels in decibels are also shown with positions 
S,’ designated arbitrarily as the 0-db reference level. It 
appears that in this experiment the apparent locations in 
depth are determined by intensity. 

In another experiment, the person speaking was located 
in the free-field room at position S, of Fig. 12. The ap- 
parent location of the reproduced sound could be moved 
from S$,’ in the listening room when a uniform overall re- 
sponse characteristic was used, to a position toward the 
listener when the response was accentuated in the frequency 
range from 1000 to 4000 cps, or to a position away from 
the listener when the response was reduced in the same 
frequency range. The frequency range from 1000 to 4000 
cps plays an important role in determining the presence® 
of the reproduced sound. When the response in this region 
is increased, the presence is increased, and the effect is to 
move the source closer to the listener. 

In another experiment, the person speaking was located 
in a room of 5500 cubic feet and reverberation’ time of 
0.7 second at positions S;, Ss, Ss, Sy, S;, and Sg of Fig. 13. 
In this experiment the level of the reproduced sound was 
adjusted so that the output was the same for all locations 
of the reproduced sound. The corresponding relative loca- 
tions of the reproduced sound in the listening room was 
Si’, So’, Sz’, Sy’, Ss’, and S,’. As the distance between the 
source of sound and the microphone is increased, the effec- 
tive reverberation of the source of sound is increased. As 
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Fig. 14. Schematic arrangement of the apparatus for converting 
a single-channel sound pickup into stereophonie sound reproduc- 
tion. 


9H. F. Olson, Musical Engineering, (McGraw-Hill Book Company, 
New York, N. Y., 1952). 

10 The reverberation time of the studio as a function of the fre- 
quency corresponds to that recommended in Olson, Acoustical Engi- 
neering, (D. Van Nostrand Company, Princeton, N. J., 1957). 
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Fig. 15. Schematic arrangement of the apparatus of a two- 
channel stereophonie system in which certain sound sources in a 
large group of sound sourees are accentuated in the stereophonic 
sound reproduction of the aggregation. 


the effective reverberation of the reproduced sound is in- 
creased, the presence is decreased, and the apparent location 
of the source of reproduced sound will move away from the 
listener. 

The systems and experiments depicted in Figs. 8, 9, 10, 
11, 12 and 13 demonstrate that practical stereophonic re- 
production of sound with the subjective effects of both 
lateral and depth distribution of the reproduced sound 
sources can be achieved by the use of a two-channel system. 


EXPEDIENTS USED IN STEREOPHONIC 
SOUND RECORDING 


It has been demonstrated in the preceding sections that 
the apparent location of a source of sound in stereophonic 
reproduction of sound is determined by the relative inten- 
sities and phases of the sounds emanating from the two 
loudspeakers. These parameters may be employed to con- 
vert a single-channel sound pickup to stereophonic sound 
reproduction. The arrangement of the apparatus for this 
is shown in Fig. 14. 

In certain musical aggregations it may be desirable to 
accentuate certain performers. This objective can be re- 
alized by the use of additional microphones as shown in 
Fig. 15. In the system of Fig. 15 the sound source S;, is to 
be emphasized. This could be accomplished by placing 
the source close to the microphone. This may be incon- 
venient. The same result can be obtained by the use of a 
second microphone in channel A placed near the source S;. 
In this way the relative levels of the source S, and the other 
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sources can be adjusted by means of the gain controls fol- 
lowing microphones M4; and M42. The sound source S2 
is another example of a source to be emphasized. However, 
in this case the output is channel B. The above expedients 
and arrangements can be extended to obtain the desired 
distribution of the apparent sources in the reproduced sound. 


REVERBERATION TESTS 


Room acoustics, as exemplified by the effective reverbera- 
tion of the reproduced sound, plays an important role in the 
establishment of realism. The effective reverberation’ is 
primarily a function of the ratio of the direct to reflected 
sound at the observation point. In the case of reproduced 
sound the effective reverberation of the sound picked up 
in the studio is the ratio of direct to reflected sound signal 
output of the microphone. The effective reverberation of 
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Fic. 16. Schematie diagram of a two-channel stereophonie mag- 
netic tape sound reproducer. 


11H. F. Olson, Acoustical Engineering, (D. Van Nostrand Company, 
Princeton, N. J., 1957). 
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Fig. 17. Perspective view of a two-channel magnetic tape stereo- 
phonic sound reproducer. 


LOUDSPEAKER 


the signal output of the microphone is primarily a function 
of the reverberation time of the studio, the distance be- 
tween sound source and the microphone, and the direc- 
tional characteristics of the microphone. The effective re- 
verberation of the reproducing system in the listening room 
is primarily a function of the reverberation time of the 
listening room, the distance between the loudspeaker and 
the listener and the directional characteristics of the loud- 
speaker. The effective reverberation of the overall system 
is the effective reverberation of the complete chain from 
the sound source in the studio to the listener in the listen- 
ing room. 

In listening to live sound the acoustics of a single room 
are involved. However, in the case of the reproduced sound 
the acoustics of two rooms are involved, namely, the room 
in which the sound is picked up and the room in which the 
sound is reproduced. If the room in which the sound is 
picked up is a free-field room as was the case of the “Stereo- 
phonic Frequency Preference Test,” then the only acoustics 
involved are those of the listening room. In effect, the 
sound sources are transferred to the listening room by means 
of a sound reproducing system. If the acoustics of the 
average living room in the home were suitable for all types 
of music, then the logical procedure would be to record all 
of the material in a free-field sound room. However, for 
most recorded material additional reverberation is required. 
The optimum reverberation of the reproduced sound is the 
sum of the effective reverberation of the studio and listen- 
ing room. In arriving at the correct value of reverberation 
of the reproduced sound in practice, it is assumed that the 
listening room will exhibit the acoustics of the average living 
room. Then the acoustics of the studio and sound pickup 
system are selected so that the optimum value of effective 
reverberation will be produced when the records are repro- 
duced in the average living room. 

In this connection, it should be pointed out for example, 
that true concert hall acoustics cannot be achieved in the 
sound reproduced in the living room of a home. Neverthe- 
less, a reasonable facsimile of the acoustics of the concert 


hall can be obtained in the living room by the proper selec- 
tion of the overall effective reverberation. 


RECAPITULATION 


As stated in the introduction, in order to achieve realism 
in a sound reproducing system the following conditions 
must be satisfied, namely: first, the frequency range must 
be such as to include all the audible components of the 
various sounds to be reproduced; second, the volume range 
must be such as to permit noiseless and distortionless repro- 
duction of the entire range of intensity associated with the 
sounds; third, the spatial and reverberation characteristics 
of the original sound must be preserved. 

As a result of the experiments described in this paper, it 
appears at the present time that a two-channel stereophonic 
magnetic tape sound reproducing system will satisfy the 
above conditions for obtaining a high order of realism in 
the home-type sound reproducing systems. 

Accordingly, home-type magnetic tape, two-channel, 
stereophonic-type reproducers have been commercialized; 
and prerecorded, two-channel, stereophonic tapes are now 
available for reproducing machines. 


SYSTEM FOR REPRODUCTION OF STEREOPHONIC 
MAGNETIC TAPES 


The elements of a two-channel system for reproducing 
two-channel, stereophonic magnetic tape is shown sche- 
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Fic. 18. The arrangement of a two-channel stereophonic tape 
reproducing system in a room showing the optimum relative di- 
mensions and the preferred listening area. 
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matically in Fig. 16. The system consists of a two-channel 
magnetic head, voltage amplifiers for each channel, volume 
controls for each channel ganged together, power amplifiers 
for each channel and loudspeakers for each channel. 

A perspective view of the commercial system for repro- 
ducing prerecorded magnetic tape is shown in Fig. 17. 
Cabinet A houses the two-channel magnetic tape repro- 
ducer, operating controls, amplifiers and the loudspeaker 
for channel A. Cabinet B houses the loudspeaker for chan- 
nel B and provides storage space for tape. 


PRERECORDED STEREOPHONIC 
MAGNETIC TAPES 


A catalog of music selections recorded on magnetic tape 
for stereophonic reproduction of sound in the home is now 
available. These tapes have been recorded for reproduc- 
tion in living room environments employing the principles 
outlined in the preceding sections. Any standard two- 
channel stereophonic sound reproducer can be used to re- 
produce the prerecorded magnetic tapes. 


REPRODUCTION OF PRERECORDED STEREOPHONIC 
MAGNETIC TAPES IN THE HOME 


A plan view of a typical living room setting for the re- 
production of two-channel stereophonic magnetic tapes is 
shown in Fig. 18. The arrangement and relative dimen- 
sions shown in Fig. 18 apply to practically any room in the 
average home. The ratios shown in Fig. 18 preserve the 
dimensional ratios for the most important subjective effect, 
namely, the angular distribution of the sound sources. 
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cial stereophonic magnetic tapes were carried out under the 
direction of Messrs. A. A. Pulley and W. H. Miltenberg. 
The development of the duplication process in the produc- 
tion of prerecorded magnetic tapes was carried out under 
the direction of Messrs. H. I. Reiskind and H. E. Roys. 
The development of the commercial stereophonic reproducer 
was carried out under the direction of Mr. J. L. Franke. 
A very brief description of the commercial tapes and sys- 
tems used for the reproduction of stereophonic sound in 
the home is given in this paper. A complete description of 
the commercial procedures and equipment developed for 
the reproduction of stereophonic sound in the home will be 
presented in papers by those responsible for this work. 
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Epitor’s Note. This British patent, a portion of 
which appears in the following pages, is reproduced 
with the kind permission of the Controller of Her 
Britannic Majesty’s Stationery Office. It is of 
historic importance in the development of stereo- 
phony. I have felt that since this issue of the 
JouRNAL is substantially devoted to this subject its 


appearance is timely and appropriate. When it is 
realized that many of the ideas, psychoacoustic, 
mechanical, and electrical, set forth in this docu- 
ment of 1931 are only now gaining wide, popular 
currency, one may well reflect on the magnitude of 
the economic forces which control the viability of 
inventions. 


British Patent Specification 394,325 


ALAN DoweER BLUMLEIN 


Application Date: Dec. 14, 1931 


Complete Accepted: June 14, 1933. 


COMPLETE SPECIFICATION 
Improvements in and relating to Sound-transmission, Sound- 
recording and Sound-reproducing Systems. 


The fundamental object of the invention is to provide a 
sound recording, reproducing and/or transmission system 
whereby there is conveyed to the listener a realistic impres- 
sion that the intelligence is being communicated to him over 
two acoustic paths in the same manner as he experiences in 
listening to everyday acoustic intercourse and this object 
embraces also the idea of conveying to the listener a true 
directional impression and thus, in the case in which the 


sound is associated with picture effects improving the illusion 
that the sound is coming, and is only coming, from the artist 
or other sound source presented to the eye. 

The invention is not, however, limited to use in connection 
with picture effects, but may, for example, be used for im- 
proving the qualities of public address, telephone or radio 
transmission systems, or for improving the quality of sound 


recordings. When recording music considerable trouble is 
experienced with the unpleasant effects produced by echoes 
which in the normal way would not be noticed by anyone 
listening in the room in which the performance is taking 
place. An observer in the room is listening with two ears, 
so that echoes reach him with the directional significance 
which he associates with the music performed in such a room. 
He therefore discounts these echoes and psychologically foc- 
uses his attention on the source of the sound. When the 
music is reproduced through a single channel the echoes 
arrive from the same direction as the direct sound so that 
confusion results. It is a subsidiary object of this inven- 
tion so to give directional significance to the sounds that 
when reproduced the echoes are perceived as such. 

In order that the physical basis of the invention can be 
appreciated and the stages of its development understood, 
known and established facts concerning the physical relations 
between sound sources, sound waves emitted thereby, and 
the human ears will be briefly summarized. 

Human ability to determine the direction from which 
sound arrives is due to binaural hearing, the brain being able 
to detect differences between sounds received by the two 
ears from the same source and thus to determine angular 
directions from which various sounds arrive. This function 
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is well known and has been employed to considerable extent 
for example in subaqueous directional detection in which 
two microphones are connected by headphones, one to each 
ear of an observer, the two channels between the micro- 
phones and the two ears being kept entirely separate. 

With two microphones correctly spaced and the two chan- 
nels kept entirely separate e.g. by using headphones it is 
known that this directional effect can also be obtained for 
example in a studio. If, however, the channels are not kept 
separate (as, for example is the case in previously proposed 
arrangements for recording and/or reproducing sound, in 
which sounds picked up by a plurality of pressure micro- 
phones are led to loud speakers which take the place of the 
headphones) the effect is almost entirely lost and such sys- 
tems have therefore not come into common use since they 
are quite unsatisfactory for the purpose. The present in- 
vention contemplates controlling the sound, emitted for 
example by such loud speakers, in such a way that the direc- 
tional effect will be retained. 

The operation of the ears in determining the direction of 
a sound source is not yet fully known but it is fairly well 
established that the main factors having effect are phase 
differences and intensity differences between the sounds 
reaching the two ears, the influence which each of these has 
depending upon the frequency of the sounds emitted. For 
low frequency sound waves there is little or no difference in 
intensity at the two ears but there is a marked phase differ- 
ence. For a given obliquity of sound the phase difference 
is approximately proportional to frequency, representing a 
fixed time delay between sound arriving at the two ears, by 
noting which the brain decides the direction from which the 
sound arrives. This operation holds for all frequencies up to 
that at which there is a phase difference of x radians or more 
between sounds arriving at the two ears from a source located 
on the line joining them; but above such a frequency if phase 
difference were the sole feature relied upon for directional 
location there would be ambiguity in the apparent position 
of the source. At that stage however the head begins to be- 
come effective as a baffle and causes noticeable intensity dif- 
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ferences between the sounds reaching the two ears, and it 
is by noting such intensity differences that the brain de- 
termines direction of sounds at higher frequencies. It has 
been stated that the frequency at which the brain changes 
over from phase- to intensity-discrimination occurs at about 
700 c.p.s. but it must be understood that this may vary 
within quite wide limits in different circumstances and from 
person to person, and that in any case the transference is 
not sudden or discontinuous but there is considerable overlap 
of the two phenomena so that over a considerable frequency 
range differences of both phase and intensity will to some 
extent have an effect in determining the sense of direction 
experienced. 

From the above considerations it will be clear that a direc- 
tional effect is to be obtained by providing impressions at 
the two ears of low frequency phase differences and high 
frequency intensity differences, and it would appear that in 
reproducing from two loud speakers the differences received 
by two microphones suitably spaced to represent human ears 
would give this effect to a listener if each microphone were 
connected only to one loud speaker. It can be shown how- 
ever that phase differences necessary at the ears for low fre- 
quency directional sensation are not produced solely by 
phase differences at two loud speakers (both of which com- 
municate with both ears) but that intensity differences at 
the speakers are necessary to give an effect of phase differ- 
ence: while initial intensity differences from the sources 
necessary for high frequencies are not sufficiently marked 
when the sounds reach the ears, and to produce suitable 
effects therefore the initial intensity differences must be 
amplified. It is for this reason that the aforementioned 
methods previously proposed (wherein only pressure micro- 
phones were used) are not succeessful in achieving the de- 
sired effect, these necessary alterations not having been 
understood or in any way attained in those prior arrange- 
ments. 

It will be seen therefore that the invention consists broad- 
ly in so controlling the intensities of sound to be, or being, 
emitted by a plurality of loud speakers or similar sound 
sources, in suitable spaced relationship to the listener, that 
the listener’s ears will note low frequency phase differences 
and high frequency intensity differences suitable for con- 
veying to the brain a desired sense of direction of the 
sound origin. In other words the direction from which 
the sound arrives at the microphones determines the 
characteristics (more especially, as will become apparent 
hereafter, the intensities) of the sounds emitted by the loud 
speakers in such a way as to provide this directional sensa- 
tion. 

It must be understood that the manual control by an ob- 
server of intensities of a plurality of loud speakers spaced 
round a motion picture screen has previously been proposed 
but this method suffers considerably from the defects indi- 
cated above, and in any case is very difficult and inconvenient 
to operate. No novelty for mere intensity control per se is 
however claimed, except insofar as the nature of the control 
is such as to provide the necessary relative phase and in- 
tensity difference sensations. 

If in accordance with the invention the sound is first 
recorded and subsequently reproduced from the records, 
the control may wholly be effected either during the recording 
or during reproduction, or may be partially carried out in 
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each stage. It must be understood that wherever throughout 
this specification the words “sound transmission” are em- 
ployed (more especially in the claims specified below), they 
cover (unless the context otherwise requires) not only the 
case in which impulses pass directly from the microphones 
to the loud speakers, but also those arrangements embody- 
ing an intermediate process or system of recording; and in 
the latter cases the said words apply to either, or both, the 
passage of impulses from the microphones to the recording 
system, and from reproducer to the loud speakers. 

More specifically the invention consists in a system of 
sound transmission wherein the sound is picked up by a 
plurality of microphone elements and reproduced by a 
plurality of loud speakers, comprising two or more direction- 
ally sensitive microphones and/or an arrangement of ele- 
ments in the transmission circuit or circuits whereby the rela- 
tive loudness of the loud speakers is made dependent upon 
the direction from which the sounds arrive at the micro- 
phones. 

The invention also consists in a system of sound trans- 
mission wherein the sound is received by two or more micro- 
phones, wherein at low frequencies difference in the phase 
of sound pressure at the microphone is reproduced as differ- 
ence in volume at the loud speakers. 

The invention further consists in a system of sound trans- 
mission in which the original sound is detected by two or 
more microphones of a type such as velocity microphones 
whose sensitivity varies with the direction of incident sound, 
and in which the dependence of the relative responses of the 
microphones to the direction of an incident sound wave is 
used to control the relative volumes of sound emitted by two 
or more loud speakers. 

The invention also consists in a system of sound trans- 
mission wherein impulses from two microphones transmitted 
over individual channels are adapted to interact whereby two 
sets of impulses are further transmitted consisting in half 
the sum and half the difference respectively of the original im- 
pulses, said impulses being thereafter modified to control 
the relative loudness of loud speakers whereby the sound is 
to be reproduced. 

The invention also consists in a system of sound trans- 
mission wherein the sound is picked up by two directionally 
sensitive microphones which are so spaced and/or with their 
axes of maximum sensitivity so directed relative to one an- 
other and to the sound source, that the relative loudness of 
loudspeakers which reproduce the impulses controlled by 
the direction from which the sound reaches the microphones. 

The invention also consists in a system as set forth above 
wherein two sets of impulses are mechanically recorded in 
the same groove. 

The invention also consists in a system as set forth above 
wherein the impulses are transmitted by radio telephony. 

The invention also consists in a system as set forth above 
in combination with means for the photographic recording or 
transmission and/or reproduction of pictures. 

The word channel, as employed herein means an electric 
circuit carrying a current having a definite form depending 
upon the original sounds in the studio. Thus two channels 
may be different not only because the average intensities or 
types of current in them differ but also because they originate 
from two microphones in different positions in the studio. 
The nature of the invention will become apparent from 
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the following description of various methods and modes of 
carrying it into effect but it must be understood that the 
different forms described are given merely by way of example 
and do not impose any restrictions upon the scope of the 
invention or the manner and means whereby it may be ac- 
complished. 

The description will be more readily understood by refer- 
ence to the accompanying drawings, wherein 

Figure 1 represents diagrammatically the assembly of one 
system according to the invention: 

Figure 2 represents a microphonic arrangement for use 
according to one form of the invention: 

Figure 3 represents a transformer arrangement employed 
in one form of the invention: and 

Figure 4 shows a symbolic representation of the arrange- 
ment shown in Figure 3. 

Figures 5, 6 and 7 represent various circuit arrangements 
applicable to various forms of the invention, while 


Figures 8, 9, 10 and 11 represent different forms of sound 
recorders which may be employed. 

It will be clear that the invention is particularly applicable 
to talking motion pictures and the following description will 
therefore be given with reference to this application. In 
one form of the invention convenient for this purpose shown 
in Figure 1 the sounds to be recorded and reproduced with 
the pictures may be received from a source a by two pres- 
sure microphones @,;, @2 mounted on opposite sides of a block 
of wood or baffle 6 which serves to provide the high fre- 
quency intensity differences at the microphones in the same 
way as the human head operates upon the ears as indicated 
above. The outputs from the two microphones are after 
separate amplification by separate similar amplifiers b,, b, 
taken to suitably arrange circuits c comprising transformers 
or bridge or network circuits which convert the two primary 
channels into two secondary channels which may be called 
the summation and difference channels. These are arranged 
so that the current flowing into the summation channel will 
represent half the sum, or the mean, of the currents flowing 
in the two original channels, while the current flowing into 
on each of two transformers, the secondary winding of 
the currents in the original channels. 
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One convenient transformer arrangement for this purpose 
is shown in Figure 3 wherein input currents from amplifiers 
b,, by are separately fed each to two primary windings, one 
on each of two transformers, the secondary winding of each 
each transformer, providing a “sum” or “difference” 
output current on account of the senses in which the primary 
coils are wound as shown. A diagrammatic representation 
of a sum and difference arrangement (which may consist of 
a transformer similar to that of Figure 3 or any other suit- 
able arrangement of circuit elements) is shown in Figure 4. 

In accordance with the form of the invention being de- 
scribed the two outputs from the sum and difference ar- 
rangement are modified in order to obtain subsequently the 
desired sound effects and one convenient circuit arrange- 
ment for effecting this is shown in Figure 5 which represents 
the portion of the circuits indicated by c in Figure 1. As- 
suming the original currents differ in phase only, the cur- 

TT 
rent in the difference channel will be — different in phase 
2 
from the current in the summation channel. This difference 
current is passed through two resistances d and e in series be- 
tween which a condenser f forms a shunt arm. The voltage 
across this condenser f will be in phase with that in the sum- 
mation channel. By passing the current in the summation 
channel through a plain resistive attenuator network com- 
posed of series resistances g, # and a shunt resistance i, a volt- 
age is obtained which remains in phase with the voltage across 
the condenser f in the difference channel. These two voltages 
are then combined and reseparated by a sum and difference 
process such as previously adopted so as to produce two 
final channels. The voltage in the first final channel will be 
the sum of these voltages and the voltage in the second final 
channel will be the difference between these voltages. Since 
these voltages were ir phase the two final channels will be 
in phase but will differ in magnitude. By choosing the value 
of the shunt resistance i in the summation channel and the 
shunt condenser f in the difference channel for a given fre- 
quency, any degree of amplitude difference in the final chan- 
nels can be obtained for a given phase difference in the 
original channels. For the low frequencies it can be shown 
that the phase difference between the waves will, for a given 
obliquity of the sound source, vary proportionately with 
frequency, being very small for a very low frequency. Thus 
for a given obliquity of the sound the current in the differ- 
ence channel will be increasingly great compared with that 
in the summation channel the higher the frequency. Hence 
the use of a shunt condenser f in the difference circuit will 
have the effect of producing a fixed intensity difference in 
the final channels for a given obliquity at all low frequencies. 


For the higher frequencies as indicated above it is not 
necessary to convert phase shifts into amplitude differences, 
but simply to reproduce amplitude differences. The shunt 
condenser f in the difference circuit is therefore built out 
with a resistance k whose value is substantially equal to 
that of resistance i. 


In building this circuit the capacity of the condenser f is 
of such value that its impedance is small compared with that 
of the series resistances d and e over the whole working range, 
while the value of resistance & is such that it equals the 
reactance of the condenser at approximately the frequency 
above which it is desired not to convert phase differences 
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into amplitude differences. The value of & is in general 
equal to that of i, in which case the amplitude differences for 
high frequencies are passed on without modification. 

It may be found necessary to employ more complex circuits 
than the shunt resistance & and condenser f in the difference 
circuit and shunt resistance i in the summation circuit, 
which however form the basic arrangement. However it 
must be understood that the circuits employed may be con- 
siderably modified as required without departing from the 
scope of the invention. 

The outputs from the modifying circuit c (Figure 1) are 
passed to amplifiers d,, d, and thence to loud speakers e,, é2 
suitably disposed on each side of a picture screen. It is to 
be understood that Figure 1 merely traces the passage of 
intelligence from the source a to a recipient and no record- 
ing or reproducing system has been shown. Such may how- 
ever be inserted anywhere along the electrical circuit such 
for example as between amplifiers 5,, b, and modifying as- 
sembly c, or between assembly c and amplifiers d,, do. 

In the latter case the impulses transmitted through the 
two channels as indicated above may for example be recorded 
on two sound tracks on a film by any suitable or known 
means, each of which records may comprise either a sound 
track of constant density and variable width (e.g. an oscil- 
lograph record), or a sound track of constant width and 
variable density (e.g. a light valve record). Alternatively 
both records may be made on a single track comprising a 
combination of the variable width and variable density 
forms of recording. 


Such a record may be reproduced by passing light from 
the same slit through the two tracks, separating the beam 
into the two record portions by means of prisms or like 
optical means and employing the outputs from two photo- 
electric cells, excited by these separate parts of the beam 
(after amplification) to operate two loud speakers disposed 
one on each side of the screen upon which the cinematograph 
pictures are projected. 

From the above description it will be clear that obliquity 
of the direction of sound wave propagation relative to the 
microphones @,, a, will produce differences of intensity at 
the loud speakers so as to give an impression to an observer 
of oblique sound incidence. 

If two very small microphones are used and placed very 
close together it may be found possible to obtain microphone 
outputs which do not differ appreciably in amplitude but only 
in phase for all working frequencies. In this case the modi- 
fying circuit may be arranged to convert phase differences 
into amplitude differences throughout the entire frequency 
range. The phase differences dealt with at the low-frequen- 
cies however may be so small that in this case slight differ- 
ences in the two microphone circuits would have very large 
effects. On this account microphone spacing of the same 
order as that of the human ears is most suitabie. 

It will be appreciated that the amount of modification 
necessary to the impulses transmitted through the summa- 
tion and difference channels as indicated above depends upon 
a number of factors, including the relative spacing of the 
microphones and of the loud speakers, and the size and posi- 
tioning of the screen. It can be shown that for low fre- 


quencies » the degree of modification required in the dif- 
ference channel as compared with the modification in the 
summation channel is given by:— 
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velocity of sound. 
= fraction of half picture film width which the image 
of the sound source is off centre. 
= angle of obliquity, in radians, of the source from 
the median plane between the microphones. 
effective distance apart of the microphones. 
width of screen of theatre. 
distance apart of loud speakers in theatre. 


This expression in effect gives the impedance of the shunt 
capacity f in the difference channel in terms of the resistance 
i in the summation channel. It holds for all frequencies 
where & is small compared to the wavelength, and is based 
on the assumption that the 6 is small and that x and s are 
small compared with the distance of the listener from the 
screen and loud speakers. 


= 
§ = 
z= 


¥ 
The portion — is a factor of the recording, and is con- 
Ok 
stant for a given arrangement if either the camera is in line 
with the microphones and the centre of the picture, or the 
action does not move appreciably to or from the microphones 
and camera. When recording, the relative distances of 
camera and microphones and the focal length of the lens 
may be adjusted to maintain this factor a constant. 
s 
The expression — is a constant for the theatre. 
x 


As re- 


gards low frequencies only, the distance apart of the speakers 
need not exceed the screen width, but should certainly not 
be closer than 70 percent. of the screen width. The closer 
the loud speakers the greater the necessary power handling 
capacity, but the less the troubles introduced by formation of 
stationary waves. 


For the high frequencies no definite expression can easily 
be obtained, and the modification, if any, used will probably 
have to be gauged empirically by trial and error. 


The argument and formula given above are based on a 
direct wave analysis and may have to be considerably modi- 
fied in order to allow for reflection of other acoustic effects. 
It is preferred therefore to introduce the modifications it is 
proposed to employ, at the theatre since all factors will then 
enter into consideration. It will be clear that, as indicated 
above, the modifying networks and channel arrangements 
may be employed between the microphones and the film dur- 
ing recording, or between the record and the loud speakers 
during reproduction, and the latter course, in addition to 
allowing of adjustment of the arrangements to suit the par- 
ticular theatre as indicated above, has the additional ad- 
vantage that the sound film can be reproduced by a single 
reproducing head or channel if, for example, one of the dual 
arrangements breaks down, or in a theatre which, having 
one installation, does not wish to go to the expense of install- 
ing a second apparatus. 


In order to employ successfully a system of the kind de- 
scribed above it is necessary to carry out preliminary experi- 
ments to determine the most suitable value of modification 
to be employed for each recording, and it is also necessary 


to standardise various factors entering into every recording. 
In the preliminary experiments, before recording, volume 
indicator measurements may be made with a standard sound 
source placed at the extremes of the “set”, ic. the space 
within which recording is to be effected, and from these the 
proposed modifying network laid out. A further experiment 
may also be effected to standardise phase angles on the film. 
At the theatre a simple adjustment may be provided to 
check and balance the input to the two channels, a length 
of test film being used for this purpose. It will thus be 
seen that the total theatre equipment necessary is very simple 
and consists in a transmission modifier (comprising two or 
four transformers, for example, artificial line resistances 
and the control network, which may be no more than a 
condenser and a resistance) and two normal sound-repro- 
ducing heads or pick-ups, or one specially designed head or 
pick-up adapted to separate the two recordings to two com- 
plete reproducing channels. There is no reason why the 
second channel used should not be the “stand-by” channel 
now often installed for safety since if, as indicated above, one 
of the channels breaks down reproduction may be continued 
without serious consequences on the other channel only. 

In connection with the standardization indicated above, 
while the binaural “transfer” frequency (from phase- to 
intensity-discrimination) need have no definite significance 
in recording, since it is a function of the human brain, it is 
nevertheless necessary to fix a change-over frequency from 
high- to low-frequency working for recording, since this fre- 
quency fixes the values of the elements in the modifier and 
thus the form of modification to be used, the distance apart 
of the microphones and the form of baffle between them. 
Any convenient frequency may be chosen as standard after 
experience has decided which is most suitable. Instead of 
standardising it may be possible from the preliminary ex- 
periments to allow electrically for variation of microphone 
positions and/or of microphone spacing (although the latter 
would be extremely difficult) and it must be understood 
that this arrangement falls within the scope of the invention. 

The above analysis is based upon considerations which 
take no account of sound reflections or interference during 
reproduction. The reflected sound waves which arise during 
recording will be reproduced with a directional sense and will 
sound more natural than they would with a non-directional 
reproducing system. If difficulties arise in reproduction 
they may be overcome by employing a second pair of loud 
speakers differently spaced and having a different modifying 
network from the first pair: or a row of speakers may be 
used with a composite, progressive modifying network to 
supply them: or the two speakers may be placed compara- 
tively close together. 


In this last arrangement the sense of direction of the ap- 
parent sound source will only be conveyed to a listener for 
the full frequency range for positions lying between the 
loud speakers; but if it is desired to convey the impression 
that the sound source has moved to a position beyond the 
space between the loud speakers the modifying networks 
may be arranged to reverse the phase of that loud speaker 
remote from which the source is desired to appear, and this 
will suffice to convey the desired impression for the low fre- 
quency sounds. With this arrangement of loud speakers 
close together, however, it would not be possible to effect 
a similar illusion in connection with high frequencies. 
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The system so far described employs to receive the sound 
waves two nondirectional microphones, e.g. pressure micro- 
phones. Directionally sensitive microphones may also be 
employed spaced a small distance apart, the outputs being 
modified as indicated so that the relative outputs of the 
loud speakers are controlled both by differences in phase 
and differences in magnitude of the microphone outputs. 
Such directionally sensitive microphones may be, but are 
not necessarily, of the type known as velocity microphones, 
and preferably provided with movable conductor elements 
so light as to move substantially as the surrounding air. 


Velocity or moving conductor microphones (e.g. moving 
strip microphones) are very suitable for any system accord- 
ing to the invention and in addition to use with circuit ar- 
rangements described above: they may also be employed 
with various alterations in the circuits. These microphones 
give a response varying as the cosine of the angle of incidence 
of the sound relative to the direction of normal or optimum 
incidence, and they therefore have the advantage that a cer- 
tain degree of loud speaker output separation may be ob- 
tained without phase-conversion or like network modifica- 
tions. 

Three general arrangements employing velocity micro- 
phones are possible and in all cases the microphones are 
placed as near together as possible instead of being spaced 
as artificial ears, as in the case of pressure microphones. 

(1) Two velocity microphones are placed one with its 
axis of maximum response directly facing in the direction of 
the centre of the scene, and the other with its axis at right 
angles to that direction. Both moving strips are in line, 
and arranged so that this line is vertical, whereas the sound 
source moves in a horizontal plane. A performer speaking 
from the middle of the scene will affect only the face-on 
microphone, but if he moves to one side both microphones 
will provide outputs, while if he moves the other way simi- 
lar outputs are provided but the phase of the edge-on micro- 
phone is reversed. Since the microphones are close together 
no phase differences are experienced between them and if 
their outputs are summed and differenced after a suitable 
amount of relative amplification the two final channels 
differ in magnitude in the correct manner for operating the 
loud speakers to give the desired directional effect. Such 
sum and difference arrangement differs from the modifying 
network employed with pressure microphones in that the 
pressure type provide phase differences (whereby direction is 
determined) which have to be converted, whereas with the 
velocity type the edge-on microphone provides an output 
proportional to the obliquity of the source. A suitable modi- 
fying arrangement for this form of the invention is shown 
in Figure 6. This is substantially identical with that shown 
in Figure 5 except that the shunt condenser f and resistance & 
in series, and the shunt resistance i are replaced by shunt 
resistances 1 m which are preferably variable as shown. 
These lines therefore form artificial attenuators and by 
altering their relative attenuation the intensity differences 
in the two lines corresponding to a given obliquity of sound 
is controlled. 

(2) Two velocity microphones or microphone elements 
may be placed with their axes perpendicular to one another 
and each axis at 45° to the direction of the centre of the 
screen. This arrangement is represented diagrammatically 
in Figure 2 wherein m and o represent two velocity, or direc- 
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tionally sensitive microphones one above the other arranged 
perpendicular to one another and at equal angles at 45° to 
the direction of the centre of the field from which sound is 
to be received. It will be clear that movement of the sound 
source a laterally to a position of removed from the centre 
of the field will result in the sound waves striking o at a more 
acute angle than they strike m and differences in the 
microphone outputs will result. The microphones are suffi- 
ciently close together to render phase differences of the inci- 
dent sound negligible and the output amplitudes therefore 
differ approximately proportionally to the obliquity of the 
incident sound. They may therefore be amplified similarly, 
and supplied directly to the loud speakers to which they 
will give the correct amplitude differences for the desired 
directional effect provided the relationship between the vari- 
ous dimensions of the recording and reproducing “lay-outs” 
are correct. If it is desired to accommodate any differences 
between the “lay-outs” the outputs may be modified by 
networks, in the manner described, suitably to increase or 
decrease the differences between them. An arrangement 
such as shown in Figure 6 is suitable for this purpose, and 
such an arrangement may of course also be employed even 
if the lay-out is correct if it is desired for any reason to 
control or modify the amplitude differences of the loud 
speaker outputs. 


(3) Two microphones may be arranged with the two 
axes lying symmetrically to the direction of the centre of 
the field and with an angle between them of say 6 degrees, 
so that sound from a performer at the centre subtends an 

6 


angle of — degrees to each microphone. If @ is small a 
2 


small movement of the performer to one side is sufficient 
to make one microphone “edge-on” and to reduce its output 
to zero, while if @ is large a large movement of the per- 
former is necessary to do this. By making @ adjustable 
different “layouts” may be accommodated without the 
modification indicated under (2) and it will be clear also 
that this provides a method of directional sound transmit- 
ting, recording and reproduction which avoids the necessity 
of combining and reseparating the two channels. 

The microphone elements in any of the above cases may 
be enclosed in a single casing if desired for convenience, 
and may also be positioned in a single magnetic system com- 
mon to both. Two velocity microphones set in line with 
one another and with their axes of maximum response sym- 
metrically inclined to the direction of the centre line of the 
scene, may, if placed one above the other, be employed 
also to provide significance of vertical as well as horizontal 
movement of the sound source in a plane perpendicular to 
the axis of maximum response of the microphone system. 
Such vertical displacement of the source will in this ar- 
rangement give phase differences to the outputs while lateral 
displacement gives amplitude differences, and these can be 
separated, the phase differences converted to intensity dif- 
ferences by modifying networks, as described, and the re- 
sulting impulses employed to operate four or more loud 
speakers distributed round the screen. The transmission 
in such a system occupies only two channels (one leading 
from each microphone) up to a point in the system where 
each of these channels is divided into two parallel channels 
thus providing four channels in all at this point. Two 
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channels, one from each parallel pair of these divided 
channels, are connected to one modifying network adap- 
ted to deal with phase differences, and the other two 
channels, one from each pair, connected to another modi- 
fying network adapted to augment intensity differences. 
Each modifying network operates a plurality of loud speak- 
ers providing a directional sensation in one direction, and 
in this manner directional senses in two directions at right 
angles can be obtained. It will be seen that in such an 
arrangement the transmission and/or recording (which is 
the most expensive and difficult operation of the system) 
may be effected over only two channels although directional 
sensations in two perpendicular directions are subsequently 
obtained. A similar effect may be obtained with a plurality 
of pressure microphones by employing suitable modification 
previous to transmission. 

In obtaining a complete directional “sound picture’’, i.e. 
both horizontal and vertical directional effects, the inven- 
tion is not limited solely to the use of two microphones. A 
plurality may be employed and their outputs suitably col- 
lected, modified and separated to transmit suitable differences 
of impulses to a plurality of loud speakers. The general fea- 
ture is that two transmitting channels, receiving impulses 
from two or more microphones for example, communicate 
impulses which can be modified and separated to provide 
two directional senses at right angles to one another, the 
sounds whereby this is done being provided by a plurality 
of loud speakers. It will moreover be clear that if the 


sound source moves away from or towards the microphones 
the overall intensity of the combined loud speaker propaga- 
tions will vary and thus provide indication of the position 


of the source along that axis. Full three-dimensional loca- 
tion of the source is thus obtained by this arrangement. 

It will be seen that while with pressure microphones it is 
preferred to transmit phase differences rather than amplitude 
differences and convert from one to the other as late as 
possible prior to reproduction with velocity microphones it 
is more convenient to transmit the two channels in phase 
but at different amplitudes, the only modification then neces- 
sary being an increase or decrease of the amplitude differ- 
ences should the reproducing “lay-out” differ from the 
recording “lay-out” or should more than two loud speaker 
positions be used. 

There is a simple method by which modifications for in- 
crease or decrease of differences between channels may be 
effected if no conversion of phase differences into amplitude 
differences is required. The method is particularly useful 
for the operation of more than two loud speakers, and is 
also useful for working into high impedances such as the 
grid impedance of a thermionic valve. The arrangement is 
shown diagrammatically in Figure 7. If the transmission is 
effected in the form of two channels r s of similar phase but 
different amplitudes, an alteration of these amplitude 
differences may be effected by connecting one wire of each 
channel r and s together at ¢ and connecting a choke wu be- 
tween the other two wires of the two channels. The outgoing 
channels v and w whose difference is to be a modification 
of the original difference, are connected by one wire each 
to the common point ¢ of the original channels, and by 
their other wires to tappings along the choke u. If the 
differences are to be increased, the tappings at which 
the output channels are connected lie outside the tappings 


to which the input channels are connected, so that the choke 
operates in effect as an auto-transformer amplifying the 
difference voltages. Similarly, for a reduction of differences, 
the output channels are tapped intermediately between the 
two input channels. Modifications of this arrangement in 
which the devices are balanced about earth, etc. may be 
arranged, but the chief advantage is that the modification 
is varied entirely by altering tappings along a transformer 
or choke, and that no great power loss is involved. 

This arrangement of a choke or transformer is well suited 
to working a number of loud speakers for binaural reproduc- 
tion. In this case, the two outputs from power valves are 
fitted to a choke such as u along which the loud speakers 
are tapped. The position of the loud speaker tappings can 
be adjusted to suit their relative positions, and it can be ar- 
ranged that the valves are working into their best imped- 
ances. Transformers may be used to ensure the speakers 
taking their correct fraction of the output. 

While, in connection with the above described systems, it 
is suggested that when it is desired to record sounds for sub- 
sequent reproduction this may be done upon a film, the in- 
vention is not limited to that medium since the recording 
may if desired be effected on discs or cylinders of suitable 
material. In carrying out the invention in this manner the 
two channels may if desired be recorded in separate grooves 
but it is preferred that they be recorded in the same groove 
having a hill and dale and also a lateral cut movement. For 
the purposes of television previous proposals have been made 
whereby a wax disc has a sound record as a hill and dale 
cut and a picture record as a laterally cut V-shaped groove 
at the bottom of the hill and dale groove, or vice versa. 
Such records appear unsuited for separate and distinct sound 
recordings since undoubtedly considerable cross-talk between 
the two recordings would occur. They can however be used 
for two channels of the kind contemplated in the present 
invention, one being only slightly different from the other, 
since a certain amount of cross-talk in this case does not 
matter, or can be allowed for. Furthermore, the records now 
proposed are distinguished from those previously known in 
that both channels may be recorded as separate cuts in one 
groove and may be recorded by a single recording tool 
(either of moving iron or moving coil type) and be repro- 
duced therefrom by a single reproducing device or pick-up. 

If the two channels being recorded are directly picked up 
from two microphones, or are intended to work unmodified 
into two speakers, that is with intensities and qualities 
similar to those of the original sounds received, it is pre- 
ferred not to cut one track as lateral cut and the other as hill 
and dale, but to cut them as two tracks whose movement 
axes lie at 45° to the wax surface, or at some other conven- 
ient angle dependent on the relative available intensities from 
lateral cut and hill and dale respectively. If, however, the 
two channels recorded are such as summation and difference 
channels, it is preferred to separate them completely into 
pure hill and dale and pure lateral cut, ie. to make the 
recording axes normal and tangential to the wax surface. 

The result in the two above suggested cases is very similar 
since channels recorded at 45° to the wax surface give their 
sum and difference as the effective lateral and hill and dale 
amplitudes. 

It will be appreciated that a record, cut as a combined 
hill and dale and lateral,-may be reproduced if desired as 
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two skew direction cuts, the basic principle being that the 
groove has amplitude in any direction in the plane at right 
angles to the direction of wax movement, and the recording 
and reproducing directions may be chosen as any pair of 
axes lines, not necessarily at right angles, in this plane. 

It would appear that for such a record, a material other 
than that now used for lateral cut records would be desirable, 
and a material of the nature of cellulose acetate is indicated. 

The track section is preferably adapted to work with a 
sapphire and have a sufficiently fine angle to give lateral as 
well as vertical control to the sapphire. 

The recorder whereby both channels may be cut by a 
single tool on the same groove may take various forms, the 
underlying feature being that a light stylus is pulled into 
two directions at right angles to one another and each pre- 
ferably at 45° to the wax surface. ee 

Figure 8 shows schematically a recorder of this kind 
suitable for producing records having complex cuts. 1 and 
2 represent the driving elements of two recorders normally 
adapted for cutting lateral cut records. These driving ele- 
ments drive arms 3 and 4 about axes at right angles to the 
plane of the paper within 1 and 2. The ends of these arms 
are connected by ligaments 5 and 6 to the end of a reed 7 
which extends backwards along an axis perpendicular to the 
paper to supports not shown. This reed carries a cutting 
sapphire 8. Movements of the recording arms 3 and 4 pro- 
duce movements in the end of the reed 7. Thus, currents in 
movement | will cause the reed 7 to move along an axis ap- 
proximately 45° to the vertical rising from left to right across 
the figure. Similarly, currents in movement 2 will produce 
movement of the reed 7 in an axis at right angles to the 
former axis, while currents in both movements will of course 
result in vertical movement of the reed. 

Another such form of recorder shown in Figure 9, repre- 
senting a moving iron recorder, may consist in a short reed 9 
mounted close above and parallel with the wax track and 
carrying the cutting sapphire 8. This reed 9 may extend 
backwards perpendicularly to the paper to supports (not 
shown) which join the top of a laminated pole system 10 to 
complete a polarising magnetic system therewith. The two 
laminated arms of the pole piece 10 extend down towards 
the free end of the reed 9. These arms form two poles adja- 
cent to a square portion of the reed at its free end, each pole 
being adapted to pull the reed in a direction at 45° to the 
wax surface. The reed may be suitably damped, e.g. by a 
rubber line and have a resonant frequency at the top of, or 
above, the working range. The two pole pieces may be 
wound with speech coils, and the energisation of one of these 
moves the sapphire in an upward direction at 45° to the 
wax surface. The terminals 15 of one channel are connected 
to main winding 12 and compensating winding 11. The 
terminals 16 of the other channel are connected to main 
winding 14 and compensating winding 13. C urrent in either 
channel will pull the reed towards the pole carrying the main 
winding, the purpose of the compensating winding being 
to prevent movement of the reed away from the other pole 
due to the flux drawn away from this pole by the main wind- 
ing. With the winding shown, currents in either channel 
will cause the reed to cut a track at approximately 45° to 
the vertical. By a suitable rearrangement of windings, or 
by a suitable transformer connection between the channels 
and the terminals of the recorder as shown, any other move- 
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ment axes may be obtained. Thus for example the tool may 
have one movement by torsion of its supporting reed and 
another by flexure thereof. 

An alternative moving coil design which may employ elec- 
tromagnetic damping may consist of a moving member in the 
shape of a T as shown in Figure 10. The recorder sapphire 
8 is supported on a light T member 17, which is supported 
at 18 by elastic means such that it may rotate about this 
point, and may also translate vertically, though it is resis- 
tant to horizontal movements in the plane of the paper. 
The device is driven by moving coils, e.g. speech coils, 19 
and 20 which are freely located and immersed in the steady 
magnetic field provided in annular gaps in a magnetic sys- 
tem, not shown. Current in one of the moving coils tends 
to both rotate and translate the device so that the sapphire 
8 moves along an axis at approximately 45° to the vertical. 
The movement of this device may be damped and equalised 
along the lines described in British Patent Specification No. 
350,998. As before any required axes of movement may be 
obtained by suitable interconnection of the two driving coils. 
Such a movement preferably has the same natural frequency 
for both rotation and translation. Further the distribution 
of mass is preferably such that a small instantaneous force 
applied at one coil produces no movement at the other. 

Figure 11 shows another form of recorder similar in 
principle to the one shown in Figure 10 except that a mov- 
ing iron drive is employed. The member 17 moving about 
axis 18 is constructed of magnetic material, or has a mag- 
netic upper portion. The “E” shaped member 21 is polar- 
ised either by being partially permanently magnetised, or 
having a magnetising winding on it, so that the central pole 
is of opposite polarity to the two outer poles. Speech wind- 
ings on the outer poles are brought out to terminals 15 and 
16 to which the two channels are connected. 

In all the devices described above, the angles of the 
axes defining the movements of the sapphire can be altered 
by suitably connecting the speech windings: for instance, 
axes which are normally inclined at 45° to the wax surface 
can be converted into pure hill and dale and lateral cut axes 
by arranging that the speech windings are in series aiding for 
one channel and opposing for the other channel. In like 
manner any axis conversion can be effected by suitably com- 
bining the channels through transformers. 

In designing an electric pick-up to reproduce both channels 
care must be taken that the inertia is kept as low as possible, 
and with this in mind a very light replica of any of the 
above described recorders may be employed. Preferably, 
a moving system in the form of a T following the lines of 
the moving iron recorder shown in Figure 11 is employed 
as best suited for the purpose. Since the fundamental reso- 
nant frequency of a pickup appears to be of no critical im- 
portance as regards its characteristic, it may not be necessary 
to adjust the resonant frequency in the two modes to the 
same value, which would simplify the design. Adjustments 
for sensitivity in the two modes may be made by suitably 
connecting coils wound on the two limbs of the magnetic 
circuits. As in the recorder design the distribution of mass 
in the reproducer is preferably such that forces producing 
motion in one direction (e.g. lateral movements) leave it 
substantially undisturbed in its reproduction by motions in 
another direction (e.g. hill-and-dale). 

(Continued on page 130) 
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Is Tape the Ideal Medium for Audio?” 


Ross H. SNYDER 
Ampex Corporation, Redwood City, California 


All sound recording media possess limitations in signal-to-noise ratio, frequency range, dis- 


tortion and economy. 
extremes than in the mid-range. 


Each is characterized by more severe dynamic limitations at the frequency 
The limits of present-day magnetic tape recording are explored, 


with special reference to slow-speed, high-information-density recording. 


IHE QUESTION which is the title of this discussion is 
answerable, if at all, only in a partial way. There is 
no ideal audio recording medium today, if by “ideal” we 
mean a recording medium which combines all the opera- 
tional and convenience features of each of the present-day 
methods, none of the disadvantages of any of them, and 
which will preserve faithfully the entire dynamic and fre- 
quency range which sensitive ears can perceive, adding no 
perceivable extraneous sound.'*:*+ 
The family of operating advantages which has made tape 
the preferred medium for record mastering, radio broad- 
casting, and stereophonic sound recording and reproduction, 
includes the following: 
. Re-usability 
. Spliceability - 
. Freedom from the effects of wear 
. Immediate replay, without processing or danger of deterioration 
. Permanence of original recording 
. Comparatively low per-minute cost of recording medium. 
Other audio recording media share some of these advan- 
tages, of course. Photographic sound recordings are, for 
example, spliceable; but photographic film is not re-usable, 
is highly susceptible to deterioration from normal wear, re- 
quires processing before replay; original photographic re- 
cordings are permanent only if preserved in almost never- 
played condition; and, of course, photographic film is costly 
in per-minute terms, compared with tape. 
Lacquer disc masters share only the advantage of com- 


* Received March 12, 1958. Presented before the Sixth Annual 
West Coast Convention of the Audio Engineering Society, Los 
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paratively low medium cost, and then only where re- 
usability is not desirable. 

But the tape medium also has disadvantages in its opera- 
tional and convenience characteristics: 

. It is not easy to find a given point inside a reel of tape, 
simply and quickly. 

. Tape is somewhat subject to breakage and stretching. 

. Tape threading is, in some applications, inconvenient. 

. All other things remaining approximately equal, tape repro- 
ducing apparatus is more costly. 

If we continue to set aside the question of ultimate re- 
produced sound quality, and assume, for the moment, that 
all the sound recording media are equal in this respect, the 
reasons for the preference for one or another of these in a 
given kind of service are quite plain. 

Magnetic tape is clearly the most satisfactory medium 
for the original recording of masters which are later to be 
copied and released in large numbers to the public, by rea- 
son of its spliceability, freedom from the effects of wear, 
immediate re-playability, permanence of original recording, 
and comparatively low medium cost. In mastering service, 
these advantages are compelling. 

Magnetic recording on oxide-coated sprocketed film, for 
ease of synchronizing with picture, is the preferred medium 
in motion-picture film recording, by reason of re-usability, 
freedom from the effects of wear, immediate re-playability, 
permanence of original recording, and comparatively low 
medium cost, in comparison with processed, non-replayable 
photographic film. These reasons for preferring the mag- 
netic medium are all, of course, separate entirely from the 
sound-quality advantages which are attributed to magnetic 
recording, in comparison with photographic. 

On the other hand, photographic sound recording is, ex- 
cept for multi-track stereophonic releases, the medium of 
preference for motion-picture release prints, because in this 
service it is much less costly to copy, and, in the practical 
situation, much less costly to reproduce in the theater or 
other commercial release situation. 


n 

u 

i / 

| 

EEE 
| _ 
| 
‘ie 
ee 
| 
. : g 
99 


20 100 500 1Ke = a 10 20 

Fic. 1. Relation between maximum useful output (upper curve) 
and cumulative noise (lower curve) for an average commercial 
tape. The curves were taken at 7% ips, half-track, and NAB 
equalization. 


While still ignoring any differences which may exist in 
the ultimate quality of sound reproduction, the single- 
channel disc release has certain unquestionable advantages 
over duplicated tape. One of the greatest of disc’s advan- 
tages is the sheer familiarity with the process. And, of 
course, quality for quality, the disc reproducer is somewhat 
cheaper to build and buy. The re-usability of tape is of 
importance to a small, but growing proportion of consumers; 
the same is true of its spliceability, and of its ability to be 
played back immediately after recording. In comparison 
with disc pressings, the basic tape, aside from the common 
cost of the music, is more costly. So, in the home, tape as 
it is now used is of interest mainly because of its freedom 
from deterioration due to wear, and because of its gift of 
the recording possibility to the consumer, so long as we set 
aside the question of its superior reproducing quality. 

Until very recently, tape has been the only suitable me- 
dium for the reproduction of stereophonic sound in the 
home. The successful demonstration of single-groove stereo- 
phonic disc pressings has been accomplished, however. So, 
still leaving out considerations of relative sound quality, we 
can begin to inquire concerning the relative merits of tape 
and disc for home stereo sound. Once more, the main ad- 
vantage of tape is its freedom from deterioration due to 
normal use, and its gift of the facility to record as well as 
reproduce, in the home. It is quite clear that the stereo- 
phonic disc pressing, if satisfactory in sound quality, has 
the same advantages over present-day stereo tape that the 
single-channel disc has over single-channel tape. 

In sober analysis, however, we cannot assume that the 
reproduced quality of these two media is identical, and we 
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cannot assume that tape mechanisms cannot be improved 
in the direction of easier threading, greater security against 
tape-breakage, and lower cost. 

Tape has several great advantages to offer the consumer, 
even in its present form. The ability to replay hundreds 
or thousands of times without audible deterioration, yet to 
be erased and re-recorded at will, is prime among these. 
Looking to the future, there is every reason to expect that 
threading and tape-handling will be greatly simplified, per- 
haps through the development of some automatic threading 
device, or fully-enclosed cartridge. If present technology is 
any indicator, tape speeds may come down, permitting a 
reduction in per-minute medium cost. The “Videotape” 
recorder has clearly proved that recorded signals of half 
the former minimum wavelength are commercially usable, 
and that higher values of bandpass times s/n ratio are 
obtainable. 

Tape has always had the advantage of requiring no me- 
chanical parts which must vibrate in precise unison with 
the transmitted signal, eliminating thus the complex mass- 
compliance problems which are inherent in a system using 
such moving parts. Tape’s constant linear velocity removes 
the problems which surround the recordist in handling | 
inner-groove distortion and dynamic limitations. 

Perhaps the most appealing of the advantages of tape to 
the designer is the definability of its performance. From 
beginning to end, a recording on tape is constant in several 
respects: 

1. The uppermost undistorted recording level, at any given 

frequency. 

2. The noise spectrum. 

3. The gentle overload characteristic. 

4. The frequency response characteristic. 

It is possible to express all this in a single graph, and 
thus to define the working area for the recordist. Figure 1 
demonstrates this. The uppermost line shows the curve of 
maximum recordable intensities. This curve was deter- 
mined from the following considerations: It could not be 
simply a curve of the output level at constant percentage 
distortion due to the approach of tape overload, because 
single-tone distortion analysis at medium and high audio 
frequencies would not include, inside the bandpass of stand- 
ard measurement, the significant distortion partials. The 
tape, which is one selected so as to be median among com- 
mercial tapes® in its sensitivity to short-wavelength record- 
ing, was first recorded to that level at which distortion was 
3% due to the approach of saturation, at 1000 cps. It was 
then recorded fully to saturation at that frequency. The 
level of fundamental tone, excluding all distortion products, 
in the saturated recording, was then measured, and com- 
pared with the level of fundamental in the 3%-distorted 
recording. The level-difference between these two was de- 
termined. Then tones of all frequencies, from 30 to 15 000 


5 J. Leslie, “Non-Uniformity of Magnetic Recording Tapes,” un- 
published dissertation. Presented before A.E.S. Convention, Los 
Angeles, March 28, 1955. 
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1S TAPE THE IDEAL MEDIUM FOR AUDIO? 


cps, were recorded beyond saturation on the tape and the 
level of maximum recoverable fundamental on each deter- 
mined. This figure was reduced, in each case, by the differ- 
ence between maximum recoverable fundamental at satura- 
tion and maximum recoverable fundamental at the 3% level, 
in the 1000-cycle recording; the resulting figure thus being 
expressed on the graph. This method assumes, therefore, 
the very worst condition, relating a level of every frequency 
which is reduced from the total output, to the total of noise. 
Note that this still gives us, in an original 7'2-ips half-track 
recording, an attainable s/n ratio of 60 db. 

The lowest line on the graph displays the accumulating 
total of noise, measured in half-octaves from the lowest to 
the highest frequencies. Thus, at any frequency of the 
graph, the noise figure is the total of the noise in that half- 
octave, plus the noise in all lower half-octaves. The noise 
in the whole spectrum is, of course, at the level shown at 
the highest frequency. 

No such simple chart can be prepared for the disc record- 
ing for several reasons: 


1. The uppermost reproducible level, at any given frequency, 
from a disc pressing, is not determined by the angular 
velocity of the groove, but by the characteristic of the 
whole reproducing stylus-cartridge-arm-turntable complex, 
and may not be the same in any two reproducing situations. 

. The uppermost recordable level varies with the diameter of 
the groove, declining significantly toward the center of the 
disc. 

. The noise spectrum is not constant, being heavily affected 
by the presence of dust and the effects of wear, and, likewise 
is not equal from outer to inner groove. 

. Distortion in the reproduction at any given level and fre- 
quency depends upon the variable characteristics of the re- 
play mechanism, including the state of its maintenance at 
the time, and the relative diameter of the groove. 


It is, thus, impossible to make direct comparisons in 
sound-quality between disc and tape, since s/n ratio, dis- 
tortion, character of noise and frequency-response of the 
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disc all vary within rather wide limits, depending on the 
history of the particular pressing, the condition and type 
of reproducer, and the diameter of the groove. 

It may, then, be said in summary that there is no entirely 
ideal audio recording medium, whether for professional or 
consumer purposes, except that tape has outstanding ad- 
vantages in re-usability, spliceability, freedom from deterio- 
ration with use, permanence of recording, and clearly de- 
finable sound-reproducing characteristics. It has disadvan- 
tages, such as inconvenience of threading, the possibility of 
breakages, the difficulty of spotting inside the recording, 
and somewhat higher medium and equipment costs, at least 
in consumer service. It is clear that tape has advantages 
which cannot readily be matched by technological progress 
in the other media, and that its disadvantages appear sus- 
ceptible to great improvement, through new tape-packing 
devices and improved tape-handling mechanisms. The 
higher cost of the tape and its reproducing equipment may 
also reasonably be expected to come down gradually, as 
wider usage encourages mass production, which, in turn, 
justifies the application of more research toward lower cost 
production methods. 
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theory, and psychoacoustic basis of the stereophonic effect makes this, in the Editor’s opinion, an exceed- 
ingly valuable reference source. 


Its concise presentation of the history, 


THE ‘STEREOSONIC’ RECORDING AND REPRODUCING SYSTEM 
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before the RADIO AND TELECOMMUNICATION SECTION 20th February, 1957.) 


SUMMARY 

The paper reviews briefly the history of stercophonic reproduction. 
The principal basic systems with their underlying ideas are described 
and compared. Some account is given of the supposed mechanism 
of natural binaural listening from the viewpoint of-direction localization. 

The principles and practice are discussed of a particular system for 
domestic use, derived from the carly work of A. D. Blumlein, and 
characterized by the use of spaced loudspeakers driven in phase, to 
which the name ‘stereosonic’ has been given. The aims of this system 
are defined, and the mathematical theory involved in its use is developed. 
Limitations and sources of error in the results achieved are described. 

Equipment used in the making of master recordings and some of the 
problems of studio technique involved are described. Consideration 
is given to the form which a domestic stereophonic record should take, 
and the standards to which such a record should conform, together 
with the requiremenis which these impose on the reproducing 
equipment. 


LIST OF PRINCIPAL SYMBOLS 


6 = Angle between radius vector of the sound source 
and the median plane. 
6, = True angle. 
6, = Apparent angle. 
h = Effective human-ear spacing. 
v = Velocity of sound in air. 
¢ = Phase difference of sound at the listener’s two 
ears. 
w = Angular frequency. 
ys = Semi-angle subtended at the listener by a pair of 
loudspeakers. 
» = Difference of the time of arrival of the sound at 
the listener’s two ears. 
Land R = Peak amplitudes at listener’s ears due to left and 
right loudspeakers. 
x = Half distance between loudspeakers. 
r = Projection of the listener’s displacement from the 
median plane on to the loudspeaker base-line, 
expressed as a fraction of x. 


d = Distance between loudspeaker base-line and the 
listener. 

y, = Fractional distance of apparent source from 
central position. 

y, = Fractional distance of true source from central 
position. 

B and y = Angles subtended by the left- and right-hand 
loudspeaker with respect to the median plane 
of the listener. 

7 = Time-of-arrival difference of the sound from left- 
and right-hand loudspeakers at the listener 
caused by path differences. 

ft, and jg = Semi time differences at the listener’s ears due to 
sounds from left and right loudspeakers. 

¢, and dz = Resultant phase of the sound at the left and 
right ears of the listener relative to the centre 
position. 


(i) THE IMPORTANCE OF STEREOPHONY IN 
REPRODUCTION 

Normal listening is always binaural, and the two ears are 
used in conjunction by the brain to interpret the sounds heard. 
If all the individual sound sources, such as voices and instru- 
ments, and the reverberant sounds, are recorded or transmitted 
by a single channel before being reproduced by one or more 
loudspeakers, it is not possible to use the binaural sense of 
location to differentiate between the venees sounds as is done 
in direct listening. 

In order to add some spatial realism. using only one channel 
a number of schemes have been tried, such as the use of spaced 
loudspeakers supplied through frequency-dividing networks, or 
several loudspeakers fed through delay networks, but none of 
these is universally successful on all types of programme. 

An honest assessment of the sound quality obtainable through 
a single channel, in which the frequency, harmonic, inter- 
modulation and noise distortions have been reduced to the limits 
detectable by the human ear, reveals that the reproduction is 
still far from lifelike. The addition of real binaural listening 
conditions seems to be the most likely factor remaining. 


SOUND 
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THE ‘STEREOSONIC’ RECORDING AND REPRODUCING SYSTEM 


Many tests have now been made which all show that, if a 
system is used in which the relative sounds at the two ears are 
each reproduced as similarly as possible to those heard in direct 
listening, i.e. with the correct relative amplitude and phase 
differences for each sound source, the reproduced sound takes 
on a new quality which is nearer to realism than can be obtained 
in any other way. 

For a given amount of ‘ambient’ or reverberant sound, it is 
possible to obtain better definition, i.e. ability to distinguish 
individual instruments or voices in a chorus. A soloist can be 
heard clearly through an accompaniment, without having to 
resort to making the soloist appear unnaturally close to the 
listener. 

The problem of hearing clear bass parts free from the unduly 
long low-frequency reverberation which is so commonly 
encountered, is considerably eased, and full bass response 
without ‘boom’ can ‘be obtained. It is noticeable that the 
reproduced sound level can be raised higher than with single- 
channel reproduction before the average listener becomes 
irritated. The ability to do this permits a closer reproduction 
of the original dynamic range of the music in cases where this 
is great. It also seems possible to obtain a given subjective 
loudness for less acoustic power than with single-channel 
operation. 

All this is quite aside from any benefits which may accrue 
from the ability to reproduce illusion of movements and the 
spatial relationships so essential to drama and opera. 

The requirements then seem to be to reproduce, with reasonable 
economy, in a system suitable for home use, the same sounds at 
each ear separately which would have been heard by direct 
listening in an optimum position in front of the performers. 
This requirement can be limited to sounds arriving from direc- 
tions covering an angle of, say, 90° in front of the observer for 
most practical purposes. No method has yet been devised to 
meet this requirement from all horizontal directions, but neither 
this nor the need for the indication of vertical angular direction 
seems to be of major significance. 


(2) HISTORY 

An explanation of the principles by which the normal human 
being uses both his ears for the purpose of spatial location has 
been sought since the classic studies in acoustics of the mid- 
nineteenth century. Attempts to reproduce this natural ability 
at a distance were made early in the development of telephony. 
One of the earliest accounts was given in L’Electricien of 1881 
by E. Hospitalier, who described the sound system installed in 
the Paris Opera House.' This basically consisted of a pair of 
microphones spaced on opposite sides of the stage, which 
supplied at a distance a number of pairs of telephones held to 
the ears of the listeners, who received an impression of 
localization. 

It was probably not for many years, until sound reproduction 
for entertainment purposes was receiving more attention, that 
any serious development work along these lines was undertaken. 
It was well understood, however, that the use of two similar 
microphones spaced by a distance equivalent to that between a 
human being’s two ears, operating through two separate channels 
into a pair of headphones, did, in fact, give a remarkably accurate 
impression of spatial location. The use of headphones never- 
theless robs the observer of the ability to locate sources outside 
the horizontal plane and prevents discrimination between front 
and rear, since in these cases it is necessary to move the head 
relative to the sound field. 

In the 1920’s it became customary to use loudspeakers for 
sound-reproducing systems instead of head telephones, and the 
problem of reproducing sound with directional sense imme- 
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diately became more complex, for reasons which are discussed 
later in the paper. 

One of the earliest workers in this field was the late A. D. 
Blumlein. He was convinced at this time that the main contri- 
bution towards the sense of spatial location was provided by 
the difference in time of arrival at the two ears which, at low 
frequencies, can be interpreted in terms of phase differences. 

He invented a system? by which the phase difference between 
the outputs of two similar microphones spaced by a distance 
small compared with the wavelength, can be converted into two 
in-phase outputs of different amplitudes driving two loudspeakers 
spaced by an appreciable distance. If the relative amplitudes 
corresponding to a given phase difference at the microphones 
are correctly proportioned, the phase differences at the two 
ears of the person listening to the two loudspeakers will be a 
close simulation of the phase differences which the listener 
would have heard had he been at the microphone position. 

Blumlein showed how the correct relative loudspeaker inputs 
could be derived from two closely spaced pressure microphones 
by the use of suitable modifying networks which he called 
‘shuffling’ networks. He also showed how velocity microphones 
could similarly be used, and solved the mathematical relationship 
between the various parameters in order to obtain the correct 
apparent position of the source with any given loudspeaker 
arrangement. He also described a means of recording two 
channels on a single disc, and, in fact, disc records of this type 
were pressed and reproduced in 1933. Experimental motion- 
picture films were made incorporating twin-track optical 
recordings, and these were reproduced along with large-screen 
projection in 1935. The industry was not prepared to introduce 
the wide screen at this date, and therefore further work on the 
application was abandoned. 

Blumlein’s method has the advantage of being a ‘free field’ 
system, the observer being able to move his head without pro- 
ducing a corresponding movement of the apparent source. It 
relies on the interaction of sounds from the two loudspeakers 
at the listener’s ears, and cannot utilize headphones. 

One of the first successful public demonstrations of stereo- 
phonic sound reproduction was given? by Bell Laboratories in 
America in 1934. The system used three microphones at the 
centre and extremities, respectively, of the sound stage, and they 
drove, through three identical channels, three loudspeakers 
similarly disposed on the reproducing stage. Using such a 
three-channel system, fairly accurate location was obtained in 
the azimuth sense, and comparatively effective location was 
possible in depth. In 1939 R.C.A. used a similar arrangement 
with three channels for the stereophonic recording of a sound 
film entitled Fantasia. 

Since the war demonstrations have been given by Philips at 
Eindhoven of a system using two microphones placed in an arti- 
ficial head, the outputs of which supply two widely spaced loud- 
speakers. In such a system the phase differences of the 
microphone outputs become of no particular significance when 
reproduced at the two spaced loudspeakers, but the relative ampli- 
tudes of the microphone outputs due to masking by the artificial 
head are reproduced at the loudspeakers, and provide a measure 
of spatial location to an observer in front of them. Such an effect, 
of course, can only take place at the upper frequencies, for 
example those above about 700c/s, and any directional effect at 
the low frequencies is neglected. 

More recently several American companies have made 
recordings using two widely-spaced microphones driving spaced 
loudspeakers via the medium of twin-track magnetic tapes, and 
one has issued some discs carrying the two channels, one on the 
outer half and the other on the inner. This has the serious 
disadvantage of halving the playing time. 
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The advent of magnetic tape, providing two synchronous 
but otherwise independent channels, made possible the com- 
mercial exploitation of Blumlein’s work, resulting in the 
‘stereosonic’ system described in this paper. 

This was demonstrated to members of the profession and 
representatives of the Press in April, 1955. In April, 1956, a 
full-scale public demonstration at the Royal Festival Hall was 
given to an audience of 1 800. 


(3) COMPARISON OF BASIC SYSTEMS 
In the preceding brief historical review the systems employed 
fall into three basic types. These are described below. 


(3.1) The ‘Wavefront’ System 

If an infinite number of microphones, placed in a vertical 
plane between the source of sound and the listener, were to be 
connected to an infinite number of loudspeakers in corresponding 
positions, clearly the radiated wave could be reproduced unaltered 
and true binaural audition would be preserved. 

A line, rather than an area, of microphones and loudspeakers 
would give perfect location in a horizontal direction, which is 
adequate for most purposes since location in a vertical plane 
seems to be impossible except by inclining the head. 

An approximation to this condition, with a corresponding 
limitation in the accuracy of the results, can be obtained using a 
finite number of microphones and loudspeakers. Contemporary 
film sound systems use several channels in this manner. The 
system described in Reference 3 also used this principle. 

Fig. 1 shows the practical arrangement. Detailed tests were 
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Fig. 1.—Arrangement used by Bell Laboratories in 1934. 


made with a loudspeaker located at one of nine positions in the 
transmitting studio, and the apparent position as estimated by 
observers in the listening studio corresponded fairly accurately in 
breadth, and to a reasonable degree in depth, when the observers 
were not too far from the centre-line. Very-high-fidelity channels 
were used, and the quality of results obtained by this system was 
said to be better than anything ever heard prior to that date. 
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The use of three independent amplifier and loudspeaker 
channels, although suitable for public use, is quite uneconomical 
for domestic use. The ultimate simplification of the wavefront 
system to two channels can, under certain conditions, give 
pleasing results. For this purpose, two microphones are placed 
about ten feet apart in front of the sound source, driving two 
identical amplifying and recording or transmitting systems with 
two loudspeakers at a similar separation. There is a decided 
tendency, however, for the sounds to appear to be coming from 
the two separate speakers, with a gap in between in which the 
sound is weak. 


(3.2) The Reproduction of Correct Sound Pressures at the Ears 

A somewhat different approach to that of the wavefront con- 
ception is to consider local conditions at the observer’s two ears. 
A system has been devised which relies on a consideration of 
these differences. The approach is based on the assumption 
that low-frequency sounds play little or no part in directional 
localization and that the principal clue to direction is the intensity 
difference at high frequencies produced by the shadowing effect 
of the head. It is shown that, if two pressure microphones are 
used as artificial ears in a dummy head, intensity differences 
between the outputs occur in accordance with the classic measure- 
ments described in Reference 3. It is then demonstrated that, 
if these outputs are applied to a pair of widely spaced loud- 
speakers placed symmetrically with respect to an observer, 
intensity differences do occur at his ears, although they are not 
as great as the original differences at the microphones. In 
connection with this system, attempts have been made to show 
that time differences are less effective than intensity differences 
in producing impressions of directional localization, and that 
incorrect time differences can be compensated by modifying the 
intensity differences. It is then argued that, since the major 
factor in localization, i.e. intensity differences at high frequencies, 
is reproduced, natural binaural listening is simulated. 

The consideration of local conditions at the ears forms the 
basis of the ‘stereosonic’ system, described in detail in Section 4. 
This springs from the original Blumlein invention. It recognizes 
that, since each loudspeaker communicates with both ears, 
differences in magnitude of the sound pressures at the loud- 
speakers at low frequencies produce phase and not magnitude 
differences at the ears, since the contributions from the two 
loudspeakers arrive at slightly differing times. A pair of direc- 
tional microphones is employed, effectively at a single position, 
to produce two outputs in phase, differing in amplitude according 
to the direction of the sound source. These are applied to a 
pair of spaced loudspeakers so as to produce at the ears a time 
difference independent of frequency at low frequencies, and an 
intensity difference using the shadowing effect of the head at 
high frequencies. It is claimed that this represents the nearest 
approach yet made to natural listening conditions. 


(3.3) Pseudo-Stereophonic Systems 

No detailed description will be given of systems in which a 
single microphone output supplies two or more recording 
channels at relative levels which are controlled manually in the 
dubbing stage. Such means are used in the dialogue sequences 
of wide-screen motion pictures and give an illusion of move- 
ment to single voices, etc., but clearly cannot give a simultaneous 
representation of the direction of a large number of sources as 
is required for most musical programmes. 


(4) THEORY OF THE ‘STEREOSONIC’ SYSTEM 


Before discussing the theory in detail, it will be advisable to 
review briefly what is known of the mechanism of spatial auditory 
location in the human being. 
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THE ‘STEREOSONIC’ RECORDING AND REPRODUCING SYSTEM 


(4.1) Mechanism of Angular Localization 


Rayleigh* in 1896, and Stewart® in 1920 both carried out 
experiments which demonstrated that intensity differences at 
the ears were insufficient to account for location at the lower 
frequencies, and that the phase differences had to be taken into 
account, although above about 1 000c/s the intensity differences 
were necessary to avoid ambiguity. In spite of this and the work 
of others, there has been some reluctance in many quarters to 
accept the importance of phase differences. 

Banister® in 1931 and the Medical Research Council’ in 1932 
give prominence to the idea that the time difference rather than 
phase difference may be the element detected, in which case 
there is no need for limitation of the effect to the lower frequencies. 

In spite of this, and much other and more recent work, no 
exact explanation can yet be given for the mechanism of sound 
location. The authors believe that the two principal quantities 
used by an observer to estimate the angle of arrival of a sound 
wave are, first, the difference in time of arrival of a wavefront, 
and secondly, the difference in intensity at the two ears. Of 
these, by far the more important is the time difference. For 
sinusoidal waves, a constant time difference at the two ears is 
equivalent to a relative phase difference proportional to the 
frequency of the original sound. At low frequencies, where this 
phase difference is less than, say, 7 radians, the direction of 
arrival could be deduced from it. For such a deduction to be 
possible, the times of passage of each wavefront past both ears 
must be identifiable to the observer. As the frequency is 
increased, i.e. the wavefronts follow each other more closely, 
the point is eventually reached when a wavefront arrives at one 
ear before the preceding one has reached the other. Since there 
is nothing to distinguish between the wavefronts, ambiguities 
arise and it is impossible to interpret the observed time differences 
uniquely in terms of direction of arrival. These ambiguities 
start to occur when the ear spacing becomes equal to a 
half wavelength. At higher frequencies the head becomes an 
appreciable obstacle and produces an intensity difference, the 
magnitude of which can be used to assess the direction of arrival. 
It may also assist in resolving the ambiguities mentioned, and 
allowing the observed time differences to be interpreted at 
higher frequencies. Recent work in America® shows that the 
direction of arrival of pure tones can be judged unambiguously 
with reasonably constant accuracy up to 1200c/s. At this 
frequency the ear spacing approximates to a whole wavelength, 
suggesting that the first ambiguity due to confusion of wave- 
fronts is overcome, possibly with the assistance of intensity 
differences. In this case the brain has to choose between two 
possible directions, widely separated and on opposite sides of the 
head. 

Where the sound waveform is complex, as in the majority of 
natural cases, the shape of the modulation envelope probably 
assists in identifying the wavefronts and making possible the 
interpretation of observed time differences. 

With pure tones at high frequencies, when it is necessary to 
tely on intensity differences to judge direction, accuracy 
deteriorates, since the amplitude differences at the ears, charac- 
teristic of the source position, may be modified by stationary 
waves and reflections from walls and other obstacles. 


(4.1.1) Cochlear Response. 

When a sound wave arrives at the ear, the immediate result is 
the production of an electrical waveform corresponding to that 
of the instantaneous sound pressure. This is known as the 
‘cochlear response’, and it can be detected by amplifying the 
potential differences between suitably placed electrodes. The 
signal, in this form, is unsuitable for analysis by the brain, 
and its primary purpose seems to be to initiate an electrochemical 
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response in adjacent nerve fibres containing the original infor- 
mation in pulse coded form. 


(4.1.2) Action Potential. 

This secondary signal is called the ‘action potential’ and it 
differs strikingly from the cochlear response. It consists of 
short pulses, of constant duration and amplitude, apparently 
occurring at random intervals. The average frequency of these 
pulses is related to the intensity of the original stimulus rather 
than its frequency. This latter is determined in some other 
way, possibly by observing which part of the basilar membrane 
is responding, although it must be admitted that pitch dis- 
crimination is more accurate than can be accounted for by 
simple resonance. Observations on single nerve fibres’ show 
that successive pulses are always separated in time by an integral 
multiple of the period of the stimulus and that they occur at a 
particular point on the cochlear response waveform. 

When a pulse has been initiated the particular fibre in which 
it occurs remains inactive for a short interval, known as the 
‘refractory period’, which varies according to the strength of the 
stimulus. This mechanism limits the pulse rate in a single fibre 
to a few hundred per second. There are, however, many such 
nerve fibres associated with each ear, and it is known that at 
least one fibre will respond at each cycle of the incoming stimulus, 
up to a frequency of 1 500c/s, perhaps higher. 

Provided, therefore, that the brain can recognize pairs of 
pulses produced at the two ears by the same sound wave, the 
original time-difference information is available to it. 


(4.1.3) Time-Comparison System. 

It is clear that, if the above reasoning is correct, each cycle 
of the incoming wave gives rise to a pulse of action potential 
up to frequencies of about 1500c/s, and that such pulses occur 
at a definite point in the cycle. This happens at both ears, and 
if there were some means of measuring the time differences 
between their production, the angle of arrival of the sound 
could be deduced. A theory has been put forward by Jeffress'® 
which suggests how this is done. 

It must be noted first that transmission of a pulse along a 
nerve fibre is not simple electric transmission like that along a 
telephone cable. It is electrochemical in nature; pulses travel 
without attenuation as they are self-regenerating, and at a 
relatively slow speed. The rates of transmission as measured do 
not exceed about 10*cm/s, i.e. several times slower than the 
velocity of sound in air. 

The theory then postulates the existence of a nerve combina- 
tion such that one nerve requires to be stimulated by two others 
simultaneously before it will respond. Engineers will recognize 
this combination as being analogous to the ‘logical-and gate’ 
ubiquitous in computer circuits. It is suggested that a 
number of these nerve combinations are spread across the 
brain and stimulated by pairs of nerve fibres of appropriate 
length, connected to the twoears. Fig. 2 shows this schematically, 
in engineering terms, remembering that transmission times are 
proportional to the physical lengths of the different connections. 
A response from one of these ‘gates’ corresponds to a pair of 
pulses separated by a definite time interval arriving from the 
two ears. Since the lengths of the nerve fibres are significant, 
it is clear that the position of the nerve combination in the brain 
associates it with a definite time interval between stimuli, and 
thus with a definite direction of arrival of the original sound. 
In the Figure, a response from the left-hand gate corresponds to 
a sound wave arriving from the right side of the observer and 
vice versa. : 

Some evidence in support of this theory has recently been 
published. Experiments on cats are described'' in which clicks. 
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Fig. 2.—Schematic of possible time-comparison system. 


separated by a known time interval, were supplied to the two 
ears independently and the response on the left and right lobes 
of the brain observed. Maximum response was obtained from 
the right lobe when the click supplied to the left ear was advanced 
by a time corresponding to a sound coming from the left of the 
subject, and vice versa. 


(4.1.4) Application of the Theory. 

The physiology of the brain is insufficiently known to confirm 
or deny the existence of such a mechanism as that postulated, 
nevertheless its extreme simplicity makes the idea attractive. 
The theory forms a useful working basis, and many of the 
observed facts can be fitted into it. It demonstrates clearly the 
importance of the first wavefront to arrive at the ears; this has 
the best chance of triggering a pulse, since the inhibitory 
mechanism is inoperative. The difficulty of localizing . pure 
tones at high frequency is also shown. Pairs of pulses originating 
from different cycles of the incoming wave will produce spurious 
responses from the gates, which the brain will have to discard. 
It is thought that, under these conditions, intensity differences 
play an important part. With complex high-frequency wave- 
forms, however, pulses will tend to be grouped round prominent 
features in the waveform, and this will again enable the time 
comparison mechanism to operate. 

Altogether, the best chance of a clear direction seems to be 
at those lower frequencies where the action-potential pulses 
can only be paired in one way to give a time difference corre- 


sponding to a possible angle of arrival. This does not mean 


that the most accurate indication will be obtained at the lowest 
frequencies. The slower rate at which the waveform crosses 
the zero-pressure axis and the higher threshold as shown in the 
Fletcher—Munson curves combine to produce a loss of resolution 
that appears as an error in estimated direction. Nevertheless 
the sense of direction is very strong at low frequencies, and the 
authors do not subscribe to the view that low frequencies play 
no part in angular localization. 


(4.2) The Aim of the ‘Stereosonic’ System 


The aim of the ‘stereosonic’ system is to reproducé at the ears 
of the listener, in as large an area as‘possible in front of a pair 
of loudspeakers, the same vector sound pressures as he would 
have experienced by direct listening in a corresponding position 
in front of the sound stage. In other words, although the spacing 
between the loudspeakers must, in general, be less than the 
width of the original sound source (except for some solo instru- 
ments and unaccompanied solo voices), if the apparent angular 
width of the reproduced sound is the same as that subtended at 
the optimum position for direct listening, the listener will imagine 
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that he has been-conveyed to the optimum position for direct 
listening in the recording studio. So far as is known, the ratio 
of reverberant to direct sound provides the only clue to distance 
of the source, except for any possible change in quality due to 
absorption of high frequencies at a distance. In general, the 
reverberation of the average domestic room is small compared 
with that of concert halls used for large musical combinations, 
and so the sense of distance thus conveyed is not substantially 
altered. 

It is impossible to achieve this aim with perfection, but an 
attempt is made in the system to reproduce the actual relative 
phases and thus the inter-aural time differences at the lower 
frequencies and the relative intensities at the upper frequencies. 

It should also be clearly understood that the system is intended 
to operate only with spaced loudspeakers and thus transmit 
the spatial effect when listening in free space. The system will 
not function correctly if the two channels are connected to left 
and right headphones. 


(4.3) Mathematical Theory 


(4.3.1) Vector Pressures at Ears in Direct Listening. 


Fig. 3 shows an actual source of sound S before two ears 
E, and Ep, spaced by a distance A in such a position that its 
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Fig. 3.—Time differential of sound at two ears from 
oblique source. 


direction is at an angle @ to the face-on position. The sound to 
Eg, however, will travel a distance EpA,p further than, and that 
to E, a distance E,; A, less than, the average. If v is the velocity 
of sound in air, the sound will take a time (A sin 0)/2v to travel 
from E,; to A;. Thus the time interval between the arrivals of 
sound at the two ears will be (A sin 0)/v. Hence, if A is small 
compared with the distance from the source, the magnitude at 
each ear will be the same, but there will be a phase difference 


game 


v 
where w is 27 times the frequency of the sound wave. 

At higher frequencies, when the wavelength is short compared 
with A, the phase angle will be large and ambiguous but the time 
delay will be the same. Owing to the masking effect of the 
head, the magnitude is not subject to accurate calculation but 
has been determined experimentally.!?: '3 

If these phase differences and magnitudes can be simulated in 
reproduction, the received sound will appear to come from the 
same angle 6. 


(4.3.2) Reproducing System. 


For domestic purposes it is generally permissible to use two 
loudspeakers only, and they must be operated in such a way as 
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to reproduce as nearly as possible the required vector pressures 
at the ears. Since the bulk of acoustic energy in music and 
speech occurs below about 700c/s, it is important that the 
system should operate effectively at low frequencies. 

In Fig. 4, Wy and We represent two similar loudspeakers 


Fig. 4.—Apparent source deduced from the time differential of sound 
produced at two cars by two loudspeakers operating at different 
sound levels. 


supplied with inputs which are in phase but of different mag- 
nitudes. The loudspeakers subtend an angle 2: at a centrally 
placed listener whose ears E, and Ep are separated by a distance 
h, which is assumed to be small compared with the loudspeaker 
‘distance. From eqn. (1) the phase difference between E, and 
E, for sound coming from either speaker will be 2wp = 
(wh sin y)/v, where 2 is the difference in time of arrival at 
E, and Eg. 

Let the average instantaneous sound pressure at both ears 
be Lsin wt from W, and Rsinwt from Wp. The sound 
pressures at each ear can be calculated as follows: 

At Ey At Eg 
Lsin oft +p) Lsinolt — p) 
Rsinol(t — pw) Rsin w(t + pw) 


Total pressure at E, = L sin w(t + pw) + Rsin w(t — p) 


Average 
Lsin wt 
Rsin wt 


From Wy, 
From Wr 


. 
=4/(L?+R?+2LR cos 2wp) sin [wr + arc tan G = tan om) 


Similarly, the total pressure at Ep is. 


L—R E 
24 p2 no . 
/(L? +R? +2LR cos 2) sin E are tan (5 a tan on) | 


Hence the phase difference between E; and Ep is $5, 


where ¢. = 2 are tan (74 tan wy) 


a L—R_ owhsing 
= 2are tan (+ pan) 


wp and d, < 2/2 


L — R whsin ds 
== ae ng sy (2) 


v 


Thus, if the loudspeakers are supplied in phase with correct 
relative amplitudes, the phase difference ¢, at the ears can be 
made to be the same as that from a sound source at any angle 
within +. In addition, if L or R is permitted to become 
negative, i.e. to have a phase reversal, the apparent direction 
lies outside the limit of +1 at frequencies where this analysis 
is valid. This phenomenon is not readily observable in practice 
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possibly because it is accompanied by a certain degree of ampli- 
tude cancellation. It has, however, been observed under 
laboratory conditions.* 


(4.3.3) Microphone Systems. 

When Blumlein’s first experiments were carried out in 1929-30 
the only readily available microphones were pressure-operated 
and had substantially circular polar diagrams. In order to 
obtain the required inputs to the loudspeakers the first arrange- 
ment to be used consisted of two such pressure microphones 
separated by about 8 in (i.e. typical distance between ears). 
The outputs were therefore the same in magnitude, for any 
source angle, but there was a phase difference given by eqn. (1). 
To convert this into the required amplitude difference demanded 
by eqn. (2), an ingenious circuit was used in which the two 
microphone outputs were first summed and differenced to pro- 
duce two new voltages in quadrature. The difference voltage 
was integrated, i.e. multiplied by a factor proportional to 1/w 
and rotated by 90°. The sum voltage was uniformly attenuated 
by a suitable amount. These two modified voltages were again 
summed and differenced, giving two final voltages, in phase 
but of different amplitudes, which it can be shown are of such 
values that, when applied to two loudspeakers, will, according 
to eqn. (2), produce at the ears the same phase angles as those 
at the microphones. 


Fig. 5.—Polar characteristics of a pair of velocity microphones at 90°, 


Fig. 5 represents two velocity microphones, e.g. ribbon type, 
with their axes of maximum response at 90°. Such microphones 
have a response which is proportional to the sine of the angle 
between the source and the plane of the ribbon. A source S, is 
assumed to be at an angle @, from the median axis of the micro- 
phones. The outputs from the microphones will be in phase at 
all values of @, if the microphones are placed on the same 
vertical axis but their outputs L and R are given by 


L x sin (45° + 8,) 
R & sin (45° — 6,) 


whence it can be shown that 
« @ 


(4.3.4) Performance of Complete System at Low Frequencies. 


If the outputs of the microphones are connected (after suitable 
amplification) to the loudspeakers, the resultant phase difference 
between the sounds at the ears will be 


$ _L—Rohsing 
2"L+R 0 
@. 


= tan 0, wee sin 


from eqn. (3) 


: 
| | a 
: me e Se 2 YJ 

; 
\V/ | 

\ 4A | 

Ee. /\. &r i 

: 
Pe : 
s : 

cy J 
GSK 
a 
te Hf om 
es /. 

a . 

ee 

ee . 

pe 
When PF z a 
ee a 
5, se ae 
— 


ae 
Las 


‘a 


ENN EB ge Gora 


108 


The ears will interpret this as an apparent sound source at an 
angle 0,, where sin 8, = ¢2v/wh, from eqn. (1); 


i.e. sin@,=tan@,sing . . . . . @) 


Fig. 6 shows the apparent angle plotted against the true angle 
for various values of ys. It will be seen that, when the listener 
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Fig. 6.—Apparent angle plotted against true angle for velocity 
microphones at 90°. 


is at such a distance that the loudspeakers subtend an angle of 
120°, the apparent angle is very close to the true angle up to 
+35°. When the listener is at other distances from the loud- 
speakers, but still on the centre line, the apparent source remains 
at the same fraction of the total loudspeaker spacing, thus 
showing that a correctly proportioned sound picture is presented 
although the angular scale has been altered. The angular 
distortion is such that the apparent source appears to be some- 
what nearer the centre than it should be over most of the range. 


(4.3.5) Performance at High Frequencies. 


If the velocity microphones have a constant polar charac- 
teristic at all frequencies, the ratio of voltages applied to the 
loudspeakers is independent of frequency. At frequencies above 
about 700c/s, however, the foregoing analysis will not hold, 
since the phase angle between the pressures at E, and Eg will be 
ambiguous. At these frequencies the masking effect of the head, 
however, causes the left ear to be more affected by the left 
loudspeaker than by that on the right and vice versa, and it can 
be easily demonstrated experimentally that directional informa- 
tion can be conveyed at the higher frequencies. 

Subjective tests were made with a number of observers using 
two loudspeakers supplied with known relative voltages from a 
source of recorded music. First a filter was inserted passing 
all frequencies up to 700c/s from a variety of sources of sound 
including male and female speech, solo, orchestral and brass- 
band music. The experiment was repeated using all frequencies 
above 600c/s. Quite definite location within about +2° was 
obtained in each case, but whereas at low frequencies the angle 
was in agreement with that predicted from eqns. (1) and (2) 
for a given loudspeaker ratio, that obtained at high frequencies 
was greater. The relationship obtained is in agreement with 
that published by other workers who rely primarily on intensity 
differences.'5 By introducing a factor of approximately 0-7 
into the ratio (L — R)/(L + R) above 700c/s, the results for 
high and low frequencies can be brought into line, except for 
extreme positions of the source. 


(4.4) Basis of Recording System 
To meet the requirements set out in Section 4.2, therefore, 
the microphone system can consist of a pair of velocity micro- 
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phones with mutually perpendicular axes of maximum response. 
The outputs from these microphones can be summed and 
differenced and the difference channel subjected to a loss of 3dB 
at all frequencies'* above 700c/s. These voltages are then 
again summed and differenced and used to supply two identical 
loudspeaker channels. If the loudspeakers are equidistant from 
the observer and subtend an angle of approximately 90° at him, 
he will receive substantially correct directional information at 
all frequencies within the spectrum. At greater distances from 
the loudspeakers the listener will observe the same apparent 
position of the source, relative to the loudspeakers. 

An important feature of a system using two coincident velocity 
microphones at 90° is that the r.m.s. sum of the outputs of the 
loudspeakers is constant for a source at a constant distance 
from the microphones, regardless of its direction. It is this 
feature which enables the system to reproduce a uniform sound 
field between the loudspeakers without the tendency to leave a 
‘hole’ in the centre. This latter effect can be very pronounced 
with the spaced microphone system. 


(4.5) Sources of Error 

Although the two amplifying channels and the microphones 
and loudspeakers have been assumed to be distortionless in the 
above explanation, it is clear that they cannot be perfectly matched 
in amplitude and phase, particularly the transducers. The 
effect of the small inevitable differences in frequency response 
can be calculated from the preceding theory, but such transducers 
will, in general, introduce phase shifts before amplitude differences 
become very pronounced. 

Calculations show that a phase shift in one channel produces 
no change in the central position, i.e. when the two channels are 
at the same level. A phase shift in one channel of as much as 
90° can introduce error factors of 2 at small values of 6, 1-5 
at 0, = 26° and 1-2 at 0, = 39°. 


(4.5.1) Effect of Asymmetric Listening Position. 

The preceding calculations have assumed an observer sym- 
metrically placed with regard to the loudspeakers and facing the 
centre line. If the observer turns his head the apparent source 
will appear to remain in the same position, as in natural listening, 
but if he moves to either side of the centre line the apparent 
position wil! move in the same direction, e.g. from S, to S; in 
Fig. 7. 


a” Sq Sa x Wr 
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Fig. 7.—Change of position of apparent source with displaced listening 
position. 


The reasons for this are, first, that the loudness from Wp 
has increased and that from W, decreased, and their relative 
phases are altered owing to the difference in the path lengths. 
Secondly, the angles subtended by the loudspeakers at the 
observer have also become unequal, The effect of these 
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inequalities makes the calculation of apparent angle exceedingly 
laborious, but it is possible, by making simplifying assumptions, 
to appreciate their general effect. Fig. 8 shows the position 
of the apparent source of Fig. 7 for various listening positions 
plotted against the positions of the real source with respect to 
the microphones. In Fig. 8, r is the projection of the observer’s 
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Fig. 8.—Apparent position plotted against true position for asym- 
metric listening position. 


displacement on the loudspeaker base-line, expressed as a 
fraction of x. The calculations ignore the effect of the path 
difference on the relative phases at the listener of the sound 
from W, and Wk; i.e. it applies where such a path difference is 
smali compared with a wavelength. The curves bear out the 
statement that the apparent source moves in the direction of the 
observer. 

In order to take account of the effect of path differences it is 
necessary to fix the scale of the layout shown in Fig. 7, so that 
a definite phase shift can be associated with any given listening 
position at a particular frequency. For this purpose let d equal 
10ft to represent a typical domestic arrangement. A listening 
position is chosen corresponding to r = 0-5, and the apparent 
position is plotted against frequency in Fig. 9 for various values of 
the true angle @,. It will be seen that, as the frequency increases 
from zero, deviations from Fig. 8 are initially small, but they 
increase rapidly above 100c/s, reaching a maximum in the region 
of 200c/s, beyond which the function approaches its initial value 
once more before reaching a second maximum at about 600c/s 
and again returning to its original value. 

The calculations have not been made for frequencies above 
which the theory given in Section 4.3.2 is valid. 

The function may reach a finite maximum, which in all cases 
is seen to lie outside the loudspeaker base-line, or it may become 
imaginary. In this case, the phase difference at the ears is so 
large that it cannot be interpreted in terms of any real angle of 
arrival. These regions are the ones in which the path difference 
approximates to an odd number of half wavelengths, the sound 
from the two loudspeakers arriving substantially out of phase 
at the two ears. 
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Fig. 9.—Apparent position as a function of frequency for asymmetric 
listening position. 
(a) & = 36-9°. 


4. 
(/) & = —36-9°. 


Fortunately, at these critical frequencies the resultant mag- 
nitude becomes small, thus allowing the remaining sound, 
which has small phase errors, to predominate. The importance 
of scale is obvious from the curves; halving all dimensions will 
allow the system to operate to twice the frequency before the 
anomalous regions are approached. 

The effect of the offset listening position is worst for true 
angles near the centre, when L and R have about the same 
amplitude. (The extreme positions, 8, = + 45°, are unaffected, 
since oniy one loudspeaker is operating.) 


(5) RECORDING EQUIPMENT 
(5.1) Microphone Combinations 
(5.1.1) Pressure Microphones. 

As stated in Section 4.3.3, the earliest experiments were 
necessarily carried out with pressure microphones. These had 
substantially spherical polar diagrams—at least, at low fre- 
quencies. The problem of producing magnitude differences 
was solved by giving them a small but definite spacing, summing 
and differencing their outputs, and integrating the difference 
vector before recombining to drive the loudspeakers. The 
significant parameter in such a case is the actual spacing employed. 
If this exceeds about a quarter wavelength of the highest working 
frequency, the resultant outputs to the loudspeakers become of 
opposite polarity and the system breaks down. Small spacing 
is desirable on this score. However, if the spacing is too small, 
the difference vector will diminish until it is comparable with 
the normal random differences in the microphones. Moreover, 
the amplification in the difference channel has to be increased 
to compensate, in addition to having a characteristic inversely 
proportional to frequency as already mentioned. Low-frequency 
electrical noise introduced by the amplifiers thus becomes the 
factor limiting the reduction in microphone spacing. A way out 
of this difficulty was sought by using more than one pair of 
microphones to cover the required frequency band, but, as can 
be imagined, this arrangement was cumbersome in practice and 
complicated theoretically. Pressure microphones were finally 
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abandoned after a composite microphone system had been tried, 
using two pairs of crystal elements, one pair being spaced 8cm 
apart and operating up to 1000c/s and the other being spaced 
lcm apart and operating from 1 000c/s upwards. 


(5.1.2) Velocity Microphones. 

By this time, ribbon microphones were becoming available. 
As is well known, these have a cosine-law polar characteristic 
in the horizontal plane. A pair of these can be used in two ways 
to produce the required outputs for driving a pair of loudspeakers. 
For example, they may be used with one microphone having its 
axis directed to the centre of the sound stage and the other as 
close as possible but with its axis at right angles. In this con- 
dition the outputs from the microphones will be in phase and 
invariant with frequency (in so far as their characteristics are 
flat), and will correspond to the sum and difference vectors 
described in Section 5.1.1 after manipulation but before recom- 
bination to drive the loudspeakers. 

Alternatively, they may have their axes equally inelined to 
the centre of the sound stage. In this condition their outputs 
are suitable for direct amplification and reproduction by a pair 
of spaced loudspeakers. Such outputs may be considered to 
be of ‘left and right’ type rather than ‘sum and difference’, as 
in the previous instance. : 

In either case it will be noted that the operation of integration 
that had to be per/ormed in the case of pressure microphones is 
unnecessary. Consequently, although ribbon microphones are 
generally less sensitive than pressure types, the overall signal/noise 
ratio of the system is improved. 


(5.1.3) Other Types of Microphones. 

It will be apparent that any pair of microphones having a 
polar characteristic other than circular can be used in this way 
to produce ‘left and right’ type signals of different amplitudes 
at a pair of loudspeakers. Whether these combinations are of 
value depends on the uniformity with which their polar charac- 


teristics can be maintained with frequency, as well as the degree - 


within which the microphones can be matched in amplitude and 
phase. 

Combinations of microphones having dissimilar polar charac- 
teristics may also be employed to produce ‘sum and difference’ 
type outputs, provided that their amplitude and phase charac- 
teristics are well matched. Combinations of polar diagrams 
which appear to show promise are those of circular with cosine, 
which would give the same directional response as the pressure 
microphones used in the early experiments, and cardioid with 
cosine, which would give substantially a one-sided version of the 
same thing. 


(5.1.4) Current Practice. 


The combination of two cosine microphones has so much to 
recommend it compared with other known arrangements that 
the principal effort has been concentrated on its improvement. 
If a pair of ribbon microphones is used, or even a specially- 
designed double-element ribbon type, the difficulties encountered 
are still quite serious. At low frequencies the fundamental 
resonances of the two ribbons cause relative phase shifts unless 
these are adjusted to occur at thé same frequency and to have 
the same degree of damping, whilst at high frequencies the 
ribbon microphone having the theoretical cosine-law polar 
diagram has yet to be designed. Departure from constancy of 
polar diagram with frequency will upset the requirement that 
the amplitude differences for any given angle of arrival shall be 
independent of frequency. Nevertheless, with all their limita- 
tions, some very acceptable recordings have been made using 
ribbon types. 


Within recent years the condenser microphone has undergone 
considerable development, enabling directional characteristics to 
be obtained from a double-diaphragm element of small size. 
Since these elements are substantially free from mechanical 
resonances within the audible spectrum, their amplitude and 
phase characteristics show great uniformity. This is fortunate 
as the polar characteristics show some variation from sample 
to sample, particularly at low frequencies. Selection has thus 
to be principally on the basis of polar response. Individual 
adjustments to match their sensitivities can easily be made. 
Pairs of these microphones, mounted with their elements as 
close together as possible in a single cylindrical holder, have 
been used with some success. Like all arrangements with one 
element above the other it is necessary to ensure that the common 
axis is perpendicular to the plane in which the various sounds 
lie, otherwise the vector relationships are upset. 


(5.2) Studio Techniques 


The problems encountered in making a ‘stereosonic’ record are 
not all of an engineering nature. The finished record must 
have a pleasing tonal balance, and the apparent spatial distri- 
bution of the instruments, if not of prime importance, must not 
sound unnatural. Using a single pair of microphones, it is 
often extremely difficult to satisfy the required conditions. In 
single-channel recording, good results have sometimes been 
achieved using only one microphone, but the occasions when 
this is possible are the exception rather than the rule. Generally, 
in order to achieve a proper balance, multi-microphone tech- 
niques have to be adopted. For obvious reasons this cannot be 
done when the direction of the various sources is significant. 
It is therefore necessary to rely on correct disposition of the 
performers in the recording studio to achieve the required tonal 
balance. In this connection, it may be remarked that the double- 
cosine combination has two working arcs each 90° wide on 
opposite sides of the common axis. This has been found useful 
in practice, particularly for large choral works, in which the 
orchestra has been arranged on the one side, with choir and 
soloists on the other. The:adoption of such unusual layouts, 
however, often makes the task of the conductor more difficult, 
since he may not be able to see all the performers without 
turning round. Considerations of this sort, which may be 
termed the technicalities of musical performance, often preclude 
the use of the purely engineering solution and make the task of 
the recording engineer even more complicated. 

Under difficult circumstances it is possible to envisage the 
need for more than one microphone pair, but it is realized that 
the use of additional microphones may introduce as many 
problems as it solves. 


(5.3) Microphone Amplifiers and Equalizers 

Amplifier design follows conventional lines. It is, of course, 
essential that pairs of amplifiers used in this type of recording 
have accurately matched frequency and phase characteristics, 
and that their gain be constant over long periods. In general, 
however, modern designs employing negative feedback have 
no difficulty in achieving the required consistency. The same 
requirement for accurate matching applies to any microphone 
equalizers that are used, e.g. for bass correction when working 
in close proximity to an artist. 


(5.4) Mixers 


As stated in Section 5.2, the use of a single microphone pair 
for stereophonic recording is almost mandatory. It is neverthe- 
less possible to foresee circumstances in which it may be necessary 
to use a second pair or to introduce extraneous effects, 
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Some form of mixer is therefore necessary, although this will 
generally be a simple device compared with those used in con- 
ventional recording, which may have to cater for relatively large 
numbers of microphone channels. 

A stereophonic mixer in current use does permit the use of 
two microphone pairs with independent level controls. It also 
includes means for providing the correction at high frequencies 
referred to in Section 4.4. This requires that the signals, if not 
of the sum-and-difference type, shall have been subjected to a 
sum-and-difference operation. 

A further facility is provided which gives an additional degree 
of flexibility in the recording studios. This consists of a 
differential control permitting the relative amplification of the 
‘sum and difference’ channels to be altered over a limited range. 
This control allows the apparent angle of the total reproduced 
sound to be modified and can be used to good effect in several 
ways. The microphones may have to be withdrawn from the 
artists in order to get more ‘ambience’ into the recording. This 
will reduce the apparent stage width, but the full width can be 
restored by suitably attenuating the sum channel relative to the 
difference. Similarly, if the need for great ‘presence’ calls for a 
very close microphone position, the reproduction may cause a 
solo instrument to sound much too large, and this can be corrected 
by attenuating the difference channel relative to the sum. 

The signals are finally subjected to a second sum-and-difference 
operation to convert them to ‘left and right’ type for application 
to the input of a twin-track tape recorder. The replay amplifiers 
of the machine drive two matched loudspeakers suitably adapted 
for stereophonic monitoring. 

A simplified diagram showing the operations performed in a 
typical recording channel is given in Fig. 10. 


POLAR 
CHARACTERISTICS 


Fig. 10.—Functional schematic of recording system. 


An interesting feature of the control system is the peak-level 
indicator. The use of two such meters, one on each channel, is 
unsatisfactory owing to the difficulty of observing them simul- 
taneously. In this equipment a peak-level indicator circuit,!® with 
the ability to measure the true level of short transients, is con- 
nected to each recording channel, and the meter on the mixer 
panel can be switched to either channel at will. Alternatively, 
for general use, a circuit is provided whereby the meter is elec- 
tronically switched to whichever channel has the higher level. 
The operator then uses his main gain control in the ordinary 
way to avoid overloading either channel on the tape. 


(5.5) Magnetic-Tape Recording Machines 
In normal record production it is the practice, irrespective of 
the final form of the commercial article, to make master record- 
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ings on magnetic tape for convenience of play-back, editing, etc. 
These advantages apply equally to stereophonic recording. The 
system described employs professional machines of high quality, 
modified by the addition of twin-track heads and twin-head 
amplifier channels. 

A standard tape speed of 15in/sec has been chosen for two 
reasons: 

(a) Satisfactory standard of master quality can be obtained at 
this speed y 


(6) The majority of single musical works car: be recorded within 
the full 11 in-diameter tape spools and thus simplify the copying 
process. 

The record heads are provided with bias from a common 
oscillator to avoid heterodyne beats due to residual crosstalk. 


(6) RECORDING AND REPRODUCING METHODS 


(6.1) Choice of Media 


It is essential that the recording of the two channels should 
be made on a medium which allows the two tracks to be per- 
manently linked together so that the correct phase relation 
between the signals is maintained throughout any copying 
process and replay operation. 

In 1931 when Blumlein conceived the basic idea of his system, 
the only recording medium capable of giving reasonable results 
was the wax-cut disc with its electrolytic copying process. He 
therefore proposed applying the two signals from the microphone 
system to the complex transducer so arranged as to cut a single 
groove. If the axes of movement of the transducer armatures 
carrying left and right signals are inclined at 45° to the surface 
of the wax and are thus at 90° to each other, the resulting lateral 
cut can be arranged to represent the sum of the two channels 
and the ‘hill and dale’ to represent the difference. Such a 
disc could be played on a normal gramophone to give the 
equivalent reproduction of a single-channel recording. The 
early experimental samples of the complex-cut disc suffered 
from considerable interaction at the higher frequencies and from 
excessive background noise. The fine-groove technique may 
simplify the problem, but to produce a complex-cut disc to the 
standards demanded by the modern record is very difficult and 
the problem has not yet been solved satisfactorily. 

In the United States stereophonic disc records have been 
marketed in which the lateral-cut tracks are disposed in two 
concentric bands. This record is replayed with two pick-up 
heads mounted on a common arm. The operation of locating 
both pick-up styli in their correct grooves is a delicate one. 
The frequency response and distortion characteristic of the two 
tracks will differ quite appreciably when there is a large difference 
between the track diameters. 

There have been various suggestions for modifying the fre- 
quency band of the signals of one of the channels, so that both 
channels can be recorded on one groove by means of a single 
transducer. One such method proposed by Livy!” suggests that 
one set of signals is arranged to modulate a carrier, then to 
select the lower sideband and apply this, together with the normal 
signal band of the other channel, to a wide-range cutter head. 
Reproduction is effected by using a wide-range pick-up, separating 
the two frequency bands and applying the upper band to modu- 
late a carrier of the same frequency as that used for recording. 
The lower sideband produced by this modulator contains the 
same frequency components as the original signal. Livy also 
proposed to record the carrier frequency on the disc so that it 
would be used during reproducing to control the frequency of 
the local oscillator. This double-bandwidth system demands a 
very high performance in terms of frequency response, and thus 
if a frequency range up to 10kc/s be required, the full bandwidth 
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of the system will be at least 20kc/s. This wide bandwidth is 
very difficult to attain, particularly at the inner groove diameters. 

If recordings are made side by side on a continuous film, the 
synchronization problems are largely overcome, and while it 
is recognized that optical methods have been used, the recent 
development of the magnetic-tape process makes it a more 
attractive system. 


(6.1.1) Crosstalk. 


The overall crosstalk between channels should be better than 
30dB at all frequencies. In the production of a commercial 
tape record there are at least four stages where crosstalk can 
take place: 

Master recording. 

Master replay. 

Copy recording, commercial tape record. 
Copy replay. 

If copy masters are used two more stages of crosstalk are 
introduced. In practice, it has been found that the same amount 
of crosstalk is introduced during the recording process as during 
the reproducing process. 

In well-designed heads a crosstalk of —50dB can be attained, 
and if the crosstalk at the various copying stages can be added 
arithmetically the overall figure for a commercial tape record 
will be. —38 dB, or if a copy master is used, —34dB. If it is 
required to replay either twin-track or single-channel half-track 
tape records from the same head system, the crosstalk of this 
replay head must be better than —55dB, and preferably better 
than —60dB, in the 1 000-3 000c/s region. 

Crosstalk can take place owing to the mutual inductance of 
the heads and owing to leakage fields from one track to the 
head on the other track. The mutual inductance can be reduced 
by placing magnetic screens between the heads and separated 
from them by brass or aluminium spacers. These screens should 
be large enough to shield the magnetic circuit and the windings. 
The usual back gap should be eliminated, and the head dimension 
should be kept as small as possible. 

A useful reduction of crosstalk can be achieved by either 
series or parallel cross-connection of the windings, so that a 
small signal from one head is injected in anti-phase to the leakage 
signal on the other head. If carefully adjusted the series cross- 
connection can reduce the residual crosstalk by at least 10 dB. 

Appreciable crosstalk may occur if the front edge of the outer 
magnetic screening shield is too close to the magnetic tape. The 
flux from-.one track enters the shield and then passes to the 
magnetic yoke of the other head operating on the other track. 
A clearance between the tape and the shield of 0-2in is sufficient 
to avoid this effect. 


(6.2) Track Standards 


The early experimental stereophonic magnetic-tape recordings 
were made with the heads displaced by 234in. This enabled 
the conventional half-track heads to be-used on the existing 
recording and replay machines, and the separation between the 
heads was sufficient to make crosstalk negligible. When copy 
tapes were made for issue as commercial records it was soon 
obvious that the use of in-line heads would make for easier 
acceptance as an international standard for domestic and com- 
mercial tape records. It was also decided to change to in-line 
heads on the master recordings, so that the process of editing 
was simplified. Furthermore, the use of in-line heads eliminates 
phase shift between the signals on the two tracks owing to small 
variations of elasticity of the tape along its length. 

Standardization has been effected on the designation of the 
tracks, which is as follows: 


If the tape moves from left to right and with the active side facing 
away from the observer the top track shall be designated No. 1 


‘as outlined in Section 4.5.1. 
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track and shall carry the recording for the left-hand channel as 
viewed from the audience. The bottom track shall be designated 
No..2 track and shall carry the recording of the right-hand channel. 


The replay response characteristic (100microsec) and the 
track dimensions are in accordance with Amendment No. 1 to 


B.S. 1568: 1953, relating to tape speeds of 74in/sec. The track 
dimensions are as shown in Fig. 1i. 
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‘Fig. 11.—Track standards. 


(a) Record-head track. 
(6) Replay-head track. 
Views looking on coated side of tape. 


(6.3) General Requirements for Domestic Operation 

The tape transport mechanism may be of the conventional 
type, but the associated twin-track magnetic replay head must 
conform to the standards laid down above. Since the signals 
from the two tracks on the tape are left and right, it is essential 
that the gains of the two replay channels should be closely 
matched, and for this purpose preset controls should be provided 
for initial adjustment. It is a further requirement that the 
equality of gain should be maintained for all settings of the._main 
coupled gain control. Large relative phase shifts in the two 
channels should be avoided, particularly at low frequencies. 

The underlying principle of the system demands that the loud- 
speakers should possess, as far as possible, uniform polar response 
characteristics in the horizontal plane at all frequencies. It has. 
been suggested that departure from uniformity can be an 
advantage in maintaining the apparent position of the source 
for a wide range of listening positions. Such non-uniform polar 
characteristics cannot reasonably be attained except at high 
frequencies. Even if it were possible to extend the directional 
characteristics down to the low frequencies, the beneficial results 
claimed would not be realized on account of the path differences, 
At the high frequencies non- 
uniform polar characteristics have the disadvantage that the 
overall tonal balance will vary with the listener’s position. 
Furthermore, the directional response will emphasize any back- 
ground hiss and will tend to identify the two loudspeakers as 
separate sound sources and thus’ interfere with the illusion. In 
addition, the domestic user might have difficulty in positioning 
the loudspeakers with sufficient accuracy. . 

If cone loudspeakers are used the rear radiation should be 
suppressed, and this can be conveniently done by using a closed 
box baffle. Such baffles can have quite small volume, and they 
can be designed to occupy a very small floor area. Any loss of 
bass response due to the small enclosed volume can be com- 
pensated electrically in the power amplifiers. 


(6.3.1) A Commercial Model. 

One form of domestic reproducer has been described by 
Smith and Martin,'* but for the sake of completeness a brief 
general description may be of interest. The complete machine 
consists of two consoles each containing a loudspeaker group 
and a power amplifier. One of these consoles also contains a 
tape deck with its associated head amplifiers, tone and gain 
controls. Fig. 12 is a schematic. Bass and treble ganged tone 
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Fig. 12.—Functional schematic of reproducer. 


controls are provided, in order to allow some adjustment for the 
different acoustic conditions which may be met. 

In order to maintain uniformity between the two channels the 
controls are of the stepped-switch type. The main gain control 


consists of two ganged switches with 12 steps of 3dB each.. 


There is a differential gain control on the panel to take up any 
slight drift in amplifier gains and to allow for asymmetry in the 
listening room. A preset continuous control on the amplifier 
chassis balances the gain during factory adjustment. The power 
amplifiers are rated at a peak output level of 10 watts. 

In order to give the best horizontal polar distribution the 
elliptical cone loudspeakers are mounted in closed rigid box 
baffles of 34 ft? capacity, the major axis of the cone ellipse being 
vertical. The frequency range of this speaker is limited at the 
upper end to about 5000c/s, in which region the electrostatic 
speaker takes over and continues beyond 15kc/s. The electro- 
static speaker consists of a curved metal back plate 24in long 
by 14in wide, over which is laid a membrane metallized on the 
side away from the back plate. This speaker is driven from a 
separate amplifier and is supplied with a bias potential. of 
300 volts. 


(6.3.2) Domestic Listening Conditions. 

In general, ‘stereosonic’ tape records are balanced with regard 
to tonal quality and perspective, for reproduction in the average 
home lounge in which the reverberation time is of the order of 
0-Ssec or less. This reverberation period is short compared 
with that in the studio or concert hall in which the record- 
ings will have been made; it is not likely to interfere with the 
listener’s impression of the original studio conveyed to him by 
the stereophonic effect; and there will be no confusion due to 
the two distinct reverberations which might blur the detail. 
Provided that the walls and the floor are reasonably absorbent 
it is found that, in spite of errors due to asymmetry, one can 
move about over a large floor area in front of the loudspeakers 
without losing the major benefits of this type of reproduction. 
As stated earlier in the paper, the optimum position for listening 
is at the apex of an equilateral triangle with the base-line formed 
by the line joining the two loudspeakers; this is the arrangement 
used for monitoring during recording. In the average room it is 
satisfactory to set the loudspeakers 10 or 12ft apart, but in a 
restricted space a very fair performance can be obtained down 
to distances of as little as 4 or Sft. 


(6.4) Operation of the System in Large Halls 
Special difficulties attend the reproduction of ‘stereosonic’ 
records on a large scale, apart from the general fact that, like 
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all other types of record, they are balanced primarily for domestic 
conditions. 

The most serious errors are caused by increased path 
differences. These occur, as outlined in Section 4.5, down to 
lower frequencies and over a much greater proportion of the 
listening area than is the case when operating at the scale for 
which the system was designed. 

In a small room reflections from the walls, etc., decay rapidly 
and follow each other at such short intervals that the resulting 
sounds appear to the listener to coalesce. In large. halls the 
relatively undamped primary reflections from the surroundings 
arrive at the listeners’ ears at intervals sufficiently great to cause 
an appreciable distortion of the sound picture. 

Ideally the system requires that the loudspeakers shall be 
uniformly radiating point sources. In the domestic case a 
reasonable approximation to this can be achieved because single 
small units can be constructed to handle the required power. 
In large-scale reproduction, multiple units or single units of 
appreciable size are necessary; this departure from the ideal 
can cause a depreciation of the stereophonic definition. 
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(9) APPENDIX 


(9.1) Change of Position of Apparent Source under Asymmetric 
Listening Conditions 
In the asymmetric case there are three principal reasons for 
departure from the simple theory applicable to central listening. 
These are as follows: 


(a) Operation of the inverse-square law to alter the magnitudes of 
sound pressures from the loudspeakers at the listener. 
(6) The phase difference due to the unequal path lengths from the 
loudspeakers to the listener. 
0 The unequal angles subtended by the loudspeakers at the 
ener. 


The method of calculation of apparent position is shown 
below. Fig. 13 shows diagrammatically a source S on a sound 
stage PQ, towards which a microphone pair M is directed. The 
microphone outputs are reproduced on a similar stage by a pair 


857) 
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Fig. 13.—Position of apparent source in the asymmetric case. 


of loudspeakers, W,, Wp, assumed to have uniform polar charac- 
teristics over the frequency range considered. The virtual image 
S, thus produced is observed by a listener O displaced from the 
central position C. 

Then let 


2x = Distance between loudspeakers W, and Wr. 
d = Distance between listener O and base-line. 
rx = Displacement of listener parallel to base-line. 
Yq = Fractional distance of apparent source S from 
central position: 


L? sin 2wpy 


R? sin 2wjtp 
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y, = Fractional distance of real source from central 
position. ' 
L, R = Magnitudes of sound pressures due to W, and Wz 
at C. 
B, y = Angles subtended at listener by W, and We. 
7 = Difference in time of arrival of sounds from W, 
and Wg at O due to path difference. 
4 eR = Time differences at ears E, and Eg relative to O 
of sounds from W, and We. 
¢z, dz = Resultant phase relative to O of sound at left and 
right ears. 
6, = Apparent angle of arrival of sound at O. 
6, = True angle of source with respect to microphones 
for magnitudes L and R from the loudspeakers. 


‘Then: 


OW, = ay/[(1 + 1}? tan? f + 1] 
OWg = dy [(1 — r)? tan? ¢ + 1] 


OW, — OWp 
v 


T= 


= “fd +r) tan? +1] —V[d —rP tan? 41} 


Relative pressures at the listener’s ears are as follows: 


At E, At Eg 
Lsin w(t — tT + py) Lsin w (t — tT — py) 
V{(d +r? tan2f +1] V[ +r? tan? + 1) 


From W, 


Rsin w(t — pp) Rsin w(t + pp) 
From Wr a —retany +1] Vid —nPtany +1] 


The resultant phases at the ears relative to O, derived from the 
above, are: 


Lsino(p,—7) Rsin wpe 
V[(d +r tan2gti]) = V[—rP tan?y +1) 
Loos w(y, — T) 4 R COS WiLp 
V[(A +r tan? g +1) V[G — 1) tan? yf + 1) 


$, = arc tan 


Lsinw(up+7) Rsin wip 
V[(d+rPtan2y+1) [0 —rp tan? + 1)U 
Leos w(z +7) 4 ROS Wit 
V[U +r) tan? fp +1] V[(i—r)? tan? f +1), 


op = — arc tan 


whence it can be shown that the total resultant phase difference 
at the ears is: 


LR2 sin w(prz, — fp) COS WT 


(i+rPtan2y+1 G@—rPtan?yZ+1 


+ Ya + rp tan? yp + IJv[d —rP tan? f+ 1 


(tz — $n) = arc tan ) 5 —— — 


R: cos 2wtR 


LR2 cos w(fyp — hp) COS WT 


(i +r) tan? +1 


(1 —r) tan? yf + 1 


V[G +r)? tan? f + 1}V[G —r} tan? yf + 1} 


——| 
Se et 
) ALAN. —— 
\ J — ee | 
| \4 _—— 
—_— ee 
_ -_ I 
a ae = | 


THE ‘STEREOSONIC’ RECORDING AND REPRODUCING SYSTEM 


D?dwpy 


R?2wpp 


115 
ar aking the approximation, on which the system is based, that wp, and wpe are small, thus implying also that $, and $x 


+ LR2u(j, — pg) COS wT 


($x — px) > tan ($, — $x) > 


(i+rftantZ+1 ( —r? tant fy +i 
“Rz 


Vii + r? tan? fb + 1}/[(l — r? tan? yf + 1) 


L4 


R LR2 cos wt 


C+ Pang +i  G—yPan gti’ Yd tangy tify —y tan yp $1] 
wh sin 0, 
(¢: — $x) = ———* 


Therefore 
Substituting for 4, and jz in the expression for (¢, — ¢,): 


(L/R) sin B (RIL) sin y 


v 


0, = arc sin [ ie - +0| 


(sin B — sin y) cos wr 


6, = arc sin 


(i+rPtan?y+i (@—rP tan?y +1 


Vid +r) tan? f + 1JV[0 — vr)? tan? yp + 1) 


TR RIL 


2 COS WT 


+P tangy +1’ O—rP tant gy +i * Vd tran yg + 1d — PP any + I] 
The fractional distance of the apparent source from the central position is then given by 


_ dtan6, _ 


-- x 


_ tanO, _ 
~ tangy 


Similarly, the fractional distance of the real source from the central position before the microphones is given by 


tan 6, 
tan 45° 


‘ 


45° being half the working angle of the microphones. 


= fan 6, 


DISCUSSION BEFORE THE RADIO AND TELECOMMUNICATION SECTION, 20TH FEBRUARY, 1957 


Mr. J. Moir: I am professionally interested in the repro- 
duction of sound films in which three-channel stereophonic 
reproducer systems are common, but I have an interest in 
domestic stereophonic systems as an enthusiastic amateur. 

I have been playing these stereosonic magnetic-tape recordings 
for some months, and my only comment on their performance 
is that the authors have understated the advantages. The spatial 
distribution and the impression of size that a good stereophonic 
system gives to an orchestra is most effective and most important. 
The individual instruments and sections of the orchestra are 
clearly separated, which gives a degree of clarity and definition 
to the performance that cannot be obtained from a monaural 
system, however much the amplitude distortions are reduced. 
A monaural system with 0-5% distortion does not sound so 
‘clean’ as a stereophonic system with 5% distortion. 

It has been appreciated for the last 25 years that the public 
will not accept a reproduction of an orchestra with the full 
original frequency range or volume range. Many reasons have 
been advanced, all with some degree of truth, but there is no 
doubt that the use of monaural reproducer systems is a major 
factor in this preference for range restriction. 

It has often been said that the advantages of a stereophonic 
system are too subtle for the ordinary members of the public to 
appreciate, but my experience tends to the opposite point of 
view. Fifteen or twenty of my friends have heard these stereo- 
phonic tapes run on my domestic equipment and have had the 
opportunity of making an immediate comparison with a long- 
playing recording of the same work run monaurally. After 
about ten seconds of monaural reproduction the invariable 
comment is on the flatness. 

Though I am fairly enthusiastic about these stereophonic 


recordings I am not so enthusiastic about the authors’ explana- 
tion of how the system works. They suggest that their two- 
channel system gives stereophonic results because it allows the 
brain to compare the amplitude of the signals at the two ears, 
the signal at-each ear being the vector sum of signals reaching 
the ear from both loudspeakers. They support this suggestion 
with an elegant mathematical analysis in which I can find no 
questionable step, but in spite of this, I do not believe that the 
stereophonic results can be explained in this way or so simply. 

A source emitting a single sound pulse in the studio will result 
in two pulses reaching each ear, but I would suggest that only 
the first pulse at each ear reaches the brain. The nervous system 
transmits amplitude indications as a pulse code, pulse-repetition 
rate being related to sound intensity. However, the maximum 
pulse rate cannot exceed 1 000 pulses/sec, while 300-500 pulses/sec 
is the more usual maximum. This pulse-rate limitation is 
imposed by the maximum rate at which the transmitting-end cells 
can be chemically recharged by the blood stream. Thus, after a 
discharge, the nerve cannot transmit a second signal for a time 
which is always greater than | millisec. However, the head 
dimensions are such that the second acoustic pulse will appear 
from the remote loudspeaker with a time interval which is always 
less than 0-6millisec. At first sight, it would appear that no 
stereophonic effects could be produced if the sound source emitted 
short pulses, though in fact the stereophonic effect is most 
marked. 

Our own work indicates that there are clues to source position 
distributed over the whole of the audio-frequency range though 
the majority of the information is contained in the frequency 
range above 500c/s. 

Dr. E. C. Cherry: During the course of a recent informal 
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meeting* I expressed the opinion that stereophony, using two- 
loudspeaker systems, is impossible. In view of the authors’ 
demonstration, it seems behoven upon me to explain myself 
further. 

I repeat that it is impossible, but the practice of engineering 
continually involves the solution of impossible tasks; it is a 
regular search for compromise. In the present context, the 
theoretical impossibility arises from several reasons, not the 
least important being that the acoustic environments in the 
recording studio and in the home or concert hall are simply not 
reciprocal. Why, it may be pertinent to ask, did the demon- 
stration take place in the Great Hall of Northampton Polytechnic 
and not, more conventionally, in the Lecture Theatre? Again, 
the acoustic conditions may be adjusted to some extent, but they 
cannot be suitable for all listening situations; neither can the 
listener be free to move far from a centre-line or other given 
contour without the apparent directive qualities vanishing. 

The binaural directive location of a sound, in real-life environ- 
ments, is not yet understood; more fundamentally, the psycho- 
logical ‘projection’ of sounds to lie ‘outside’ our heads is not 
understood, nor are the physical controlling factors. Binaural 
phase (time) difference and amplitude difference, by themselves, 
do not account for such projection and angular discrimination. 

The brain makes great use of the differences between the 
signals reaching the two ear portals, and there are several 
differences other than time and amplitude (e.g. effects of room 
acoustics and the listener’s experience of typical environment 
effects; head-turning effects; head diffraction; the varied angle 
of arrival of wavefronts, and impedance mismatch at the ear 
portals). I agree with Mr. Moir that Fourier-type analysis, and 
theory such as that given in the paper, are best laid aside when 
trying to understand stereophonic hearing. Statistical analysis 
rather than Fourier analysis may be the brain’s function. 


My remarks do not purport to be disparaging in the least, but 
I wish merely to stress the need for much more psycho-physical 
study, not only of binaural hearing, but of other sensory functions 


in communication. Perhaps then we may understand results 
such as those which were so well demonstrated by the authors. 

Mr. G. Millington: In the early days of electrical recording I 
did some work under Mr. P. W. Willans on high-fidelity recording 
of the piano. We found that, with a single microphone in a 
fixed position, it was extremely difficult to avoid an uneven 
response in which some notes had a dully wooden sound while 
others were unnaturally brilliant. When one note was cured by 
moving the position of the microphone, it was usually at the 
expense of another in a different part of the register. 

I should like to ask the authors whether the placing of the 
microphone system is similarly critical when using two micro- 
phones at the same position but orientated in different directions. 
As far as I could tell, the recording given was perfect in this 
respect, and was as fine an example of piano recording as I 
have heard. 

Mr. A. M. Thornton: I assume that the system is intended to 
have a great future in high-quality and more realistic recording, 
and that ultimately we shall have this sort of stereophonic quality 
in our homes. 

In the case of the single instrument—the piano—the quality 
and fidelity were the most satisfying that I have experienced, but 
when it came to the orchestral item ‘Peter and the Wolf’ I was 
very disconcerted. The notes alternated from one side of the 
stage to the other exactly like the sound of the ping-pong ball 
in the table-tennis. game previously demonstrated. I realized 
then that that is how the conductor must hear the music, i.e. quite 
differently from the vast majority of the audience who are sitting 
some distance away. This raises the interesting point: Does the 

* ‘The Psychology of Communication’, Journal I.E.E., 1957, 3, (New Series), p. 209. 
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conductor endeavour to produce a good effect at, say, the middle 
of the hall, or is he the only person really hearing the music as 
it should be heard? 

In using the system for producing recordings in the home, 
should the listener be virtually transferred from a position in the 
auditorium to the conductor’s rostrum, or should the aim be 
to ‘seat’ him effectively in a position, say, centrally five rows 
from the front? 

Major W. V. G. Fuge: In the demonstration with two people 
speaking at the same time and recorded stereophonically, it 
seemed possible to listen at will to either conversation by mentally 
concentrating without moving the head, so that a mental process 
could switch the attention from one voice to the other. 

Is that the advantage of the stereophonic recording, i.e. that 
different people listening to the same music can concentrate on 
any particular instrument they wish from time to time and 
switch their attention to what pleases them? 

Mr. F. Oakes: Mr. Thornton asked whether the stereosonic 
recording should reproduce what the conductor hears from the 
rostrum. The answer is in the negative, because the balance of 
instruments is adjusted during rehearsal to provide the correct 
sound in the auditorium. Depending on the acoustic properties 
of the concert hall, this may well mean that what the conductor 
hears is far from acceptable, whilst correct placement of the 
microphones further from the orchestra will produce a satis- 
factory balance, such as would be enjoyed from a good seat in 
the auditorium. 

Mr. J. K. Webb: To anyone who has been privileged to hear 
a demonstration of the authors’ stereosonic reproducing system 
in an average-size living room, ‘revelation’ is about the only 
word adequately to describe it. This scheme is surely a mutation 
which is bound to set a completely new standard and inevitably 
expose the deficiencies of ‘monosonic’ recording. 

It must always be borne in mind, when considering domestic 
sound reproduction, that one cannot contemplate the enormous 
range of sound levels with which the authors have battered our 
eardrums in their demonstration, however commendable this 
may be from the technical angle. Let it be said in praise of 
music in the home, as it was of Cordelia, ‘Her voice was ever 
soft, gentle and low—an excellent thing in woman’. To circum- 
vent the Fletcher-Munson effect and otherwise wrest the maxi- 
mum satisfaction from a limited range of loudness is a problem 
which perhaps merits more attention than it has so far been given. 

Mr. R. Vermeulen (Netherlands: communicated): There can 
be no doubt about the benefit, nay the necessity, of stereophony, 
if one really wants a life-like reproduction and not just ‘hi-fi’. 
By its very nature, even the most perfect loudspeaker cannot 
possibly be any better than a hole of the same dimensions in the 
wall of the concert hall. It is still something of a miracle that 
this hole can be enlarged into a wide window frame by means 
of only two loudspeakers, but we have to accept this as a 
fact. 

Such a window seat would not be considered quite satisfactory 
in the concert hall. What we really want is a seat right in the 
middle of the hall. Therefore, though I agree that at present 
and for most practical purposes ‘the reproduction can be limited 
to sounds arriving from directions covering an angle of, say, 
90° in front of the observer’, I do not consider this to be the final 
solution of perfect music reproduction, but expect a still further 
development. 

It is true that we can imitate the orchestra itself by stereophonic 
methods, but this imitation orchestra, like the real one, will only 
sound well in a hall with good acoustics and not in a room with 
poor acoustics or in the home. In order to have a completely 
life-like reproduction it is necessary also to simulate the sound 
waves reflected from the ceiling and from the walls of the concert 


THE ‘STEREOSONIC’ RECORDING AND REPRODUCING SYSTEM 


hall. These are not limited to an angle of 90°, but they come 
from all sides, and it is essential that they do so. The reverberant 
sound must not only have the right reverberation time; it must 
also have the right diffuseness. 

It is possible to simulate such a diffuse reverberation artificially 
by placing loudspeakers all round the auditorium and feeding 
them with suitably delayed music, using several different time 
delays. By adding such a stereo-reverberation to the stereo- 
phonic reproduction, a completely life-like reproduction can be 
obtained. 

I am impressed by the elegant and convincing way in which 
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the authors have reduced the intensity differences of the loud- 
speakers to phase differences at the ear of the listener, but I still 
have some doubt whether binaural hearing can be explained by 
only one single principle. Phase differences with pure tones are 
not equivalent to differences in time of arrival of clicks, and I 
cannot see, at present, how the precedence effect, for instance, 
can be fitted to the authors’ theory. Moreover, there is some 
experimental evidence that differences of intensity at the ear of 
the listener cannot altogether be neglected. I am not yet com- 
pletely convinced that it will be wise to base our techniques on 
a single effect only. 


THE AUTHORS’ REPLY TO THE ABOVE DISCUSSION 


Mr. H. A. M. Clark, Dr. G. F. Dutton, and Mr. P. B. 
Vanderlyn (in reply): As Mr. Moir states, it is agreed that direc- 
tional clues of various kinds occur at all audible frequencies, 
and their relative importance must vary to some extent with the 
spectral content of the sounds with which a system is concerned. 
In the present case, where the reproduction of music is the chief 
aim, it is contended that low frequencies play at least as important 
a part as high frequencies. In the design of a stereophonic 
system, due regard must be paid to the types of clue capable of 
reproduction within the physical and economic framework, and 
if any are found to be mutually exclusive, a choice must be made 
as to which to reject. In the case of the stereosonic system, it 
was decided to exclude the ‘precedence’ clue, since to reproduce 
it would necessitate specification of the precise listening posi- 
tion to be adopted, from which the observer would not be 
allowed to move. For this reason, the system will not deal 


correctly with the case cited by Mr. Moir, i.e. a single short 
sound pulse. It is fair to say, however, that naturally occurring 


single sound pulses are infrequent, and analysis of most sounds 
of short duration reveals them to be composed of many cycles 
of pressure variation. For localization, the system, as Mr. Moir 
states, allows a comparison of aural amplitudes, but this occurs 
at high frequencies only; at low frequencies the mechanism of 
vector summation described in the paper permits a phase com- 
parison. 

Many workers in this field of endeavour will at times be 
inclined to agree with Dr. Cherry’s statement that two-channel 
stereophony with loudspeakers is impossible. Much depends on 
the precise meaning attached to the terms ‘stereophony’ and 
‘impossible’. He does not deny, however, that reproduced 
sounds having a pronounced spatial character have been demon- 
strated, at least to those listeners in the more favoured seats. 
Perhaps these qualifications may help to illuminate his state- 
ment. 

In the engineering basis of the system, nothing is assumed 
about the way in which the human ‘black box’ operates, nor is 
any such assumption necessary, since all that is attempted is a 
reconstruction of the acoustic conditions that would obtain at an 
observer’s ears were he actually present in the recording studio. 
One requirement for satisfactory reproduction is that the listening 
room should not add appreciably to the total reverberation, and 
it should be stressed that the system is designed for domestic 
use where this is normally the case and where the listener can 
occupy the optimum position. It is not intended for large-scale 
demonstration. 

Before finally rejecting any explanation, based on Fourier 
analysis, of the effects produced, it must be remembered that 


the first operation performed on a sound when it reaches the 
human cochlea is a Fourier analysis. 

In reply to Mr. Millington, the use of a double microphone 
driving two separate loudspeaker channels does largely avoid the 
unevenness in tonal quality in some parts of the register caused 
by standing waves in the studio. 

Mr. Thornton dislikes the alternation of sounds between one 
side of the stage and the other. Although this is certainly what 
the conductor hears there was no intention to give a reproduced 
performance from this viewpoint. The arrangement of the 
demonstration was intended to convey the impression of an 
orchestra occupying the full area of the stage. It is contended 
that such an impression would be realistic, but if a domestic 
user feels that the reproduction is on too large a scale dimen- 
sionally, it is always open to him to reduce the distance between 
his loudspeakers. 

Major Fuge is correct in supposing that one of the advantages 
claimed for the system is an increased freedom for selective 
listening to a particular part of the recorded sound. 

Mr. Webb raises the difficult question of domestic sound levels. 
It may well be that a sound can be made to appear louder than 
it really is by controlled juggling with the frequency response of 
the reproducing channel. The operating range of such a control 
is likely to be limited, however, and many will think that nothing 
can recreate the sensation of a really loud sound but reproduction 
at a level comparable with the original. In this respect he is, to 
some extent, at the mercy of the recording engineer, since, if 
modern recordings are not permitted to reach reasonably high 
levels during reproduction, the quiet passages will be lost in 
background noise. 

Mr. Vermeulen mentions the further application of stereophony 
to include reverberation from the back of a hall. Many of us 
who have not had the good fortune to hear the results of his 
experiments have, nevertheless, read about them with interest, 
and it may well be that where such means as he describes are 
applicable and economically feasible, they will add considerably 
to tne subjective enjoyment of a reproduced performance. 

Finally, in reply to him and to other speakers, it is not our aim 
to produce a complete theory of binaural hearing, least of all 
to ascribe it to a single mechanism. The stereosonic system 
came about as the result of considering what naturally occurring 
clues exist, and of attempting to recreate a few of them. Others, 
such as the ‘precedence’ clue, have had to be rejected. If the 
explanations offered to fit the observed facts are not looked 
upon with favour, it can only be remarked that they are not 
put forward in any spirit of pontification, but that, at least, they 
have the merit of some experimental foundation. 
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Stereophonic Sound With Two Tracks, Three Channels 
By Means of a Phantom Circuit (2PH3)* 


Paut W. Kurpscu 
Klipsch and Associates, Hope, Arkansas 


Recent expcriments by the author have resulted in a workable three-channel reproducing 
system in which the center channel is recovered from the conventional pair of input channels 


by combination. 


The output of each flanking channel is attenuated 3 db to preserve balance. 


Practical circuit details are given, and subjective listening results described. 


HISTORY 


s igger BASIC philosophy of three-channel stereophonic 
sound derived from two sources is: 
If two microphones are properly spaced relative to 
each other, and to the plane of the sound source, 
their combined output is that of a single microphone 
midway between them. The output of this phantom 
microphone can be reproduced by combination. 

The first published information on stereophonic sound 
using 3 channels appeared (as far as this writer is aware) 
in the form of a symposium!’ in 1934. 

The part of this symposium entitled “Physical Factors,” 
by J. C. Steinberg and W. B. Snow is appropriate to the 
present thesis. They showed 5 different combinations of 
microphones, channels and speakers, including one using 2 
microphones and 3 speakers, in which the 2 microphone 
outputs were mixed at —6 db to feed the center speakers. 

Mr. Harlan Thompson” has set 2 speakers in the center 
of a wall with 2 flanking corner speakers. One of the 2 
center speakers and one corner speaker were fed from one 
tape channel and the other center and corner speaker from 
the other tape channel. He reports obtaining a balance of 
sound across the room, retaining the stereophonic effect of 
2 channels, but eliminating the bifurcated effect one some- 
times gets with 2 channels alone. 

The writer’s experiments have resulted in a workable 
system with the center channel a half-and-half mixture of 
the 2 sound tracks. The flanking channels using corner 


* Received July 3, 1957. Revised manuscript received March 18, 
1958. Delivered before the Ninth Annual Convention of the Audio 
Engineering Society, New York, October 11, 1957. 

1 Bell Telephone Laboratories staff, “Symposium on Auditory Per- 
spective,” Trans. AIEE, 53, (1934), Elect. Eng., 53, pp. 9-32, 214- 
219, Jan. 1934. 

2 Personal communication. 


Mr. Thompson was then with Berlant- 
‘Concertone. 


speakers are fed from the 2 sound tracks with 3-db attenu- 
ation relative to the center channel. 


RECENT EXPERIMENTS 


Several unsuccessful experiments were tried which cul- 
minated in using 2 corner-type speakers in the room corners 
and a center speaker, all having substantially the same mid- 


range and upper range efficiencies and each driven by its 


appropriate amplifier with separate volume controls. The 
center channel was a half-and-half mixture of the outside 
channels; the mixing circuit maintained a crosstalk factor 
of more than 20 db between the outside speaker channels. 

When success in balance finally was achieved, a surprise 
occurred; the center channel was perfectly real and not just 
a simulated effect to fill up a hole in space. Sounds re- 
membered as arising in the center of the stage occurred 
there; one ceased to hear sounds from the 3 speakers but 
actually sensed a spread across the curtain of sound. 

In explanation, the phantom talking circuit of telephone 
practice may be used as an analogy, wherein 2 physical 
pairs of conductors provide 2 talking circuits, and the pairs 
used as conductors provide a 3rd or “phantom” talking 
circuit. 


THREE-TRACK STEREO 


One immediately jumps to the conclusion that a 3 sound- 
track stereo system should provide at least 2, possibly even 
3 phantoms. If one is going to the expense of 3 sound 
tracks the only way to take full advantage thereof would 
be to plan the recording and playback geometry for as many 
phantoms as practicable. 

Figure 1 shows the 2-track, 3-channel system with one 
phantom. Like the telephone, there can be n - 1 phantoms 
for m physical tracks. 

Figure 2 shows the customary 3-track stereo with two 
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Fic. 1. Two-track 3-channel stereophonic, showing phantom 
channel derived from the 2 tracks. 


(n-1= 2) phantoms added. Experience suggests little is 
to be gained with this arrangement of channels. 

Figure 3 shows how 3 physical tracks might be employed 
to real advantage where a domain of 2 actual space dimen- 
sions could be covered. It is fascinating to conjecture how 
one might attain a true “3D” or 3-dimensional effect with 
a small finite number of tracks and channels. But the 
purpose of this paper is to discuss the single linear dimen- 
sion afforded by the “2PH3” or 2-track, 3-channel collinear 
array. 
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Fig. 2. Possible 3-track 5-channel stereo deriving 2 phantom 
channels. 


AMPLITUDE FOR CENTER CHANNEL 


Some guessing was done as to the level to be fed the 
center channel, but the guesses were all wrong. Experiment 
led to better thinking, and a theoretical basis was arrived 
at and corroborated. 

If m isoperiodic sources of equal intensity radiate in ran- 
dom phase, the intensity sum may be any value between 
zero and nm”, the amplitude sum running from zero to n, but 
the average amplitude tends toward a value n”.° This 
writer some time ago undertook to evaluate the use of multi- 
ple geophones in seismic prospecting,’ and to find the aver- 
age effective value for cases where m is small, particularly 


2 and 3. 
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Fic. 3. More useful application of 3-track stereo, suggesting 
possibility of 3 phantoms and a ‘‘curtain’’ of sound in depth. 
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3 Lord Rayleigh, “On the Resultant of a Large Number of Vibra- 
tions of the Same Pitch and Arbitrary Phase,” Phil. Mag., Vol. X, 
1880, p. 73 (Scientific Papers, Vol. 1). See also “Theory of Sound,” 
(New York: Dover Publications), Vol. 1, pp. 35-42. 

4 Paul W. Klipsch, “Some Aspects of Multiple Recording in Seismic 
Prospecting,” Geophysics, 1, No. 3, Oct. 1936. Also, U. S. Patents 
Nos. 2,232,612 and 2,232,613. Also Appendix 1. 


Fig. 4. Basie mixing cireuit for deriving phantom channel from 
2 sound tracks. 


To divorce the concept from electronic circuitry seems to 
complicate rather than simplify. Consider Fig. 4. 

Suppose each track source to be of negligibly low impe- 
dance so that the inputs to the output channel amplifiers 
are high impedance compared to R,. If track 2 contains 
no signal, the phantom receives half the voltage of track 1. 
If track 2 contains a signal and track 1 does not, again the 
phantom picks up half the signal of track 2. If tracks 1 
and 2 contain signals equal in intensity and phase, the 
phantom receives the same signal. 


22K 


47 


22 
Fie. 5. Typical mixing cireuit. Output cable capacity from the 
mixer box should not exceed 400 pf. 


But assume tracks 1 and 2 are not equal, as would be 
expected when 2 microphones are 30 feet apart. The center 
channel then does not receive a signal represented by the 
scalar sum a/2 + b/2, but its average effective signal will 
be Y 2/2 = 0.707, or 3 db down from the outer channels.° 
Thus the flanking channels should likewise be attenuated 
3 db. 

Figure 5 shows the circuit finally arrived at to achieve 
balance. 

The 82 K and 220 K pad elements provide 3-db attenua- 
tion for the outside channels. The 33 K resistors perform 
the mixing function. The source of signal was a Berlant 
30 with added pad attenuators of about 5 K output impe- 


5 See Appendix 1. 


Oe 119 : 

a 
ee i. | 
ae : 
ee . 
ee 

eS 
ms | 
ATK | 
33K 
= 
(3) @) 

a | a 
ee | 
ee a | 


Frequency, CPS 


30 60 Joke 20KCE 


° 
3 
| 


Fic. 6. Microphone response. 


dance, and the amplifiers fed by the “2PH3 box” of Fig. 5 
presented 500 K load impedances to the box. Thus the 
crosstalk from track 1 to channel 3° was approximately 
5000/66000 or slightly better than 20 db which was deemed 
to be more than adequate, and the 500 K amplifier inputs 
constituted negligible loads for the impedances within the 
box. 

The center amplifier was provided with an input gain 
control adjustable by means of a screwdriver. Much listen- 
ing has been conducted with different adjustments, and the 
fact that the best adjustment is full-on appears to confirm 
the validity of the theory involved in making the box. 


RESULTS 


At an orchestra rehearsal, the conductor did a lot of talk- 
ing. Using the customary 2-track, 2-channel stereo, there 
was seldom any definiteness about the location of events 
taking place in the middle of the stage. Adding the phan- 
tom center channel resulted in bringing the activity in the 
central area into focus. The director talked from the cen- 
ter; a conversation between him and the harpist assumed 
the proper space relation. The tuba sounded off from some- 
where between the center and right-flank speakers. Per- 
cussions could be pointed out with fair accuracy. A soloist 
was as accurately focused as would appear to be possible 
with a separate third track. It appears to afford all, or at 
least most, of the advantages of a 3-track 3-channel system 
at substantially less expense. In the present state of the 
art, 3-track stereo uses tape wider than 14”, which takes it 
out of the usual portable type of recording machine. 


MICROPHONE PLACEMENT 


Most of the tapes studied so far were recorded with the 
idea of obtaining maximum results from 2-track 2-channel 
stereo. When they were applied to the present 3-channel 
playback system, the microphone placement appeared good. 
One tape by a young recordist who was unaware of the 
necessity of wide microphone placement used only 10 feet 
spacing for a full band and small choral group. On 3- 
channel playback, the band is inadequately differentiated 
and the choral group spread too far. 

Two symphony orchestras were recorded with 22- and 26- 
foot microphone spacing. Both spacings gave good results. 
The 22-foot spacing was judged adequate, but capable of 


®Note “track”. implies sources; there are 2 sound tracks on the 
tape, thus 2 sources, and “channel” denotes each speaker location, 
three in number. 
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improvement. A jazz group spread out about 20 feet was 
recorded with microphones spaced 15 feet; another jazz 
group spread out about 14 feet was recorded with 10-foot 
spacing. These spacings gave good results on 2-channel 
playback and the improvement observed with 3-channel 
playback suggested that the mike placement was nearly 
optimum. 

On a 2-piano duo with a microphone over each piano, 
the 3-channel playback added little, but neither did it de- 
tract. This seems further to confirm the feasibility of 3- 
channel playback from only 2 sound tracks. 

“Longitudinal Stereophonic’’ wherein one microphone is 
placed close to and the other remote from a solo source 
such as single piano, or organ with single pipe loft, has 
been practiced in the 2-track 2-channel technique, and the 
addition of the center 3rd channel appears neither to help 
nor hurt. 

Some “Chronolateral Stereophonic’* recordings involving 
recording one channel at one time and the other channel 
later with the same or a different performer were likewise 
little benefited though not hurt by the additional channel. 


SPEAKER PLACEMENT 


When this writer- started experimenting with 2-channel 
stereo, experience at various demonstrations was unsatis- 
factory; insufficient separation or sharply focused point 
source effects marred the demonstration. Accordingly the 
approach was made using wide microphone separation and 
corner placement of speakers. It was found that, usually, 
placement of corner speakers against the long wall of a 
room was better than against a short wall. Thus in a living 
room 20 by 30 feet, the performance was better with the 
speakers 30 feet apart than 20 feet. But the wider the 
separation the more apt there is to be a “hole” in the middle, 
and this has been the subject of occasional adverse criti- 
cism. 

The phantom center channel has not merely filled this 
hole; it has satisfied the ear to the extent that there appears 
to be a solid curtain of sound rather than a group of point 
sources. Probably part of the overall effect has been due 
to fortunate microphone placement. It is this writer’s per- 
sonal opinion that the corner placement of the flanking 
speakers is a contributing factor to the genuineness of the 
stereo effect as well as to the individual speaker performance 
per unit occupied space. 

As practiced, the right side speaker is a Klipschorn.® the 
left is a Shorthorn® and the center a Rebel V,° all being 3- 
way systems using the same type midrange and tweeter 
driver units. The bass efficiencies differ slightly and the 
bass cutoffs differ considerably, but the ear is not sensitive 


wae W. Klipsch, “Experiences in Stereophony,” Audio, July 
8 Paul W. Klipsch, “Making Stereophonic Tapes,” High Fidelity, 
November-December 1955. 
° Trade Mark, Klipsch and Associates. 
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STEREOPHONIC SOUND WITH TWO TRACKS, THREE CHANNELS BY MEANS OF A PHANTOM CIRCUIT (2PH3) 


to direction for frequencies below about 100 cycles, so the 
differing bass response is not sensed. As all three speakers 
are corner-type and the center unit has no corner, it is 
proposed to design a mon-corner speaker’® for specifically 
this application. 

The three speakers need not all be of the most expensive 
type, but they should afford a full treble range and a smooth 
response within the range they cover. At least one of them 
should afford a full bass range. Since extended bass re- 
sponse is expensive, it is seen that the cost of two of the 
speakers may be reduced drastically without impairment 
of the sound quality even to a small degree. 

To give an idea of the characteristics of these three 
speakers, the large flanking unit on the right is 10 db down 
at approximately 32 cps; the smaller left flanking unit is 
down 10 db at 50 cps, and the small center unit is down 
10 db at about 90 cps. Substitution of speakers with more 
extended bass range did not improve performance, even on 
pipe organ program material, where fundamentals down to 
32.7 cps were frequent. All 3 speakers are substantially 
flat to 13 000 cps. 

All recordings have been made with the Stephens C2-OD4 
microphone with output transformer disconnected. The 
microphone response is shown in Fig. 6. The Berlant 30 is 
maintained to +0 and —2 db from 30 to 16000 cps. All 
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Fic. 7. The simplest and most effective way to accomplish 
‘*2PH3 Stereo.’’ This is self balancing, uses the least equipment, 
and is actually better than the system of Fig. 5. The triangles 
signify the power amplifiers with their output transformers, and 
the circles indicate the taps of the output transformers with the 
nominal impedances marked. It is usually necessary to employ a 
low resistance connection from ‘‘O’’ to ‘‘O’’ to prevent self oscil- 
lation or instability due to feedback loops in the power amplifiers. 


10 To be called, appropriately, “Klipsch’s Heresy.” 
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Fig. 8. Sum of 2 isoperiodie signals of equal amplitude and ar- 
bitrary phase. 


recordings were made at 15 ips. The increase in cost of 
going from single channel to 2-channel stereo is something 
like half the initial cost, assuming one already has a quali- 
fied tape machine which is convertible to stereo. The in- 
cremental cost of 2-track 3-channel or “2PH3” over 2- 
channel is in this method, only that of an amplifier and the 
smallest of the 3 speakers; the cost is almost marginal and 
the improvement almost as great as that provided by going 
from single-track to 2-track stereo. 


ALTERNATIVE CIRCUIT 


Since presenting this paper at the Convention the evolu- 
tion of this system has progressed to the stage where im- 
proved performance can be obtained with less equipment. 
The mixer circuit with its restrictions of lead capacitance, 
and one power amplifier have been eliminated. The mixer 
function is accomplished as shown in Fig. 7; the output 
transformers of the power amplifiers are tapped, and the 
middle channel receives half the voltage of the outside 
channels. Mixing and balance are automatic. 

To achieve the self-balancing features of Fig. 7, speakers 
should be “compatible.” Our own system involves speakers 
which are internally balanced by means of taps on an auto- 
transformer, so the power output per volt input differs be- 
tween the units. 

A typical setup using one Klipschorn, one Shorthorn and 
one Model H, each with the new “A” series tweeter and 
W-5 balancing network, are tabulated as follows: 


TaBLeE I. Relative outputs and transformer connections for 
typical 3-speaker set-up. 


Relative output at 
1000 eyeles for a 
given voltage input 


K —3 db 


s 
H -6 4-4 or 8-8* 
Ss —-6 16 


Output tap, 
ohms 


Speaker (Fig. 7) 


* Depending on size and shape of room. Theoretical value is 
4-4, but room acoustics, including absorption, dictate the choice 
of 4-4 or 8-8. 


Since presentation of the paper, further experience has 
evolved. A large scale demonstration was held wherein the 
dimensional integrity of a string quartet was maintained 
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with corner speakers over 49 feet apart, and soloists were 
accurately focused in the center of the stage. 

Certain recordings are made with 3 microphones, the cen- 
ter one being mixed with the outer channels to form a two- 
track recording. One such is Mercury MDS 5-4. When 
played back over the 2PH3 system, center stage events are 
brought into focus in the same way as in ordinary two-track 
recorded material. When the polarities were reversed, 
exactly the same results were obtained. This indicates that 
phase effects were not perceptible. 


APPENDIX 1. Mathematical Derivation 


In a simple form consider 2 isoperiodic signals of unit 
amplitude, but random phase. The situation is depicted 
in Fig. 8. 


The sum is: 
A =1+cos¢+jsing 
= [(1 + cos )? + sin? 6] % exp j@ 
where 6 = tan" [sin ¢/(1 + cos ¢) }. 
The amplitude is: 
|A| = (1 + 2 cos $ + 1)% 
= (2+ 20s)”. 
The average value of amplitude would be 


A = (1/m) §*(2 + 2 cos $)% dp 


= (V 2/x) §* (1 + cos ¢)” do 
= (V 2/r)2-¥ 2 [sin g/2]; 


— 4/n, 


or approximately 1.27. 


Taking half this value (since the center channel receives 
half the signal from each flank) the center channel level is 
approximately 0.63 or close to 4 db down from each flank, 
and each flank should thus be attenuated 4 db for correct 
balance. 

Practice dictates a difference of approximately 3 db in- 
stead of 4, probably due to the smaller distance from lis- 
teners to the center channel than to the flanking channels. 
Further experience in various environments may indicate a 
small amount of variable attenuation in the center channel 
to be desirable. 

The average rms amplitude value would be 


Arms = [(1/m) §* (2 + 2 cos $) dg] 
=V¥2 


and taking, as before, half, the effective center amplitude is 


2/2 times the outside channel amplitude or -3 db. 
Whether one “hears” the average or the rms value is moot, 
the 1-db difference being too small to evaluate subjectively. 
Personally, this writer presumes the rms value to be the 
applicable quantity. 
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30 feet or 
slightly less 


—_ 
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Fic. 9. Two microphones at locations A and B ‘‘hear’’ respec- 
tively over circular areas contiguous at point C. A single micro- 
phone at point C’ would ‘‘hear’’ substantially the same as the 
mixed outputs of the two microphones at A and B. 


APPENDIX 2. Optimum Microphone Spacing 


There arises a question: “How far apart can a pair of 
microphones be and still afford a true phantom response, 
that is, a response substantially like that of a single, cen- 
trally placed, microphone?” Thus far, the spacings used 
have given what are considered to be practical results but 
some theoretical basis would be desirable. Non-directional 
microphones are here assumed, although the ideas can be 
extended to cover directional microphones. 

Obviously, if two microphones are less than 14” apart, 
they should “hear” alike, for even at 13 500 cps the phase 
shift difference would be only z/2. Again, obviously, if 
the microphones are so far apart that sounds midway be- 
tween them are attenuated more than, say, 10 db relative 
to sounds originating closer to one of the microphones, the 
fields of the two microphones could be said to fail to over- 
lap. 

Very tentatively the geometry of Fig. 9 is presented as 
workable. Based on experience the limits shown have 
proved satisfactory. Applying simple arithmetic: a micro- 
phone at A or B receives half as much sound amplitude 
from a source at point C as it would if placed at C’. The 
effective sum of both microphones would be 2-” as much 
as the scalar sum, or 3 db down. This 3-db loss appears 
to be tolerable. And it is surmised this may be considerably 
less than the limit of tolerance. 

It should be pointed out that when the two microphones 
are too far apart to produce a true phantom signal for the 
center channel, there still remains the hole-filling effect of 
the simulated middle channel. 

Two examples of very wide microphone spacing should 
be mentioned: 

A parade was recorded with microphones about 15 feet 
above the sound sources and 55 feet apart. With 2-channel 
playback the motion was evident. With 2PH3 playback 
the sense of motion was improved. Yet the microphones 
were so far separated as to act almost independently on 
nearby sounds. 
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STEREOPHONIC SOUND WITH TWO TRACKS, THREE CHANNELS BY MEANS OF A PHANTOM CIRCUIT (2PH3) 


Railroad yard sounds were recorded with microphones 
about 100 feet apart and only 8 feet from the nearest track, 
a flagrant violation of Fig. 9. Yet when the action passed 
the center it was heard vividly from the 2PH3 center 
speaker. 

There may be a way to test the effect. If 2 stereo chan- 
nels are connected respectively to the horizontal and vertical 
deflection plates of a cathode ray oscillograph, the transi- 
ents of music form a pattern which has beer: described as 
being like an animated wad of steel wool. If the micro- 
phones are too close together, the pattern is flattened. One 
could take 2 microphones on one channel, spaced as for 
stereo, and a third microphone midway between them on 
a second channel, and put the 2 channels on the vertical 
and horizontal plates. If the resultant of the 2 micro- 
phones is an exact “phantom” the amplitude and phase 
would be the same as for the single center microphone, 
and the pattern would be an ellipse with a very small minor 
axis. If the resultant is not exact in both amplitude and 
phase, the pattern would be more nearly round. This 
theoretical study is being reserved for some opportune time 
in the future. For the time being the results appear to 
justify the means. 


Motion and “Scale Speed” 


If microphones are 50 feet apart and speakers 25 feet 
apart, motion (as of a parade, or a race) will be slowed 
down by a ratio of 2:1. This may or may not be desirable, 


as in the use of slow motion speeds in movies. This fact 
should be remembered though; if the motion is already 
slow, it perhaps should not be slowed further. Thus if a 
criterion of 20- or 25-foot speaker spacing (for the full ar- 
ray from extreme left to extreme right) is used, a compara- 
ble microphone spacing should be employed. Then, having 
determined the spacing, the distance from the sound plane 
should be about half the spacing or a trifle more. Accord- 
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ing to Fig. 9, the spacing should be a trifle less than twice 
the distance from the plane of the sound source. Experi- 
ence in recording sounds in motion is still too limited to 
express this more precisely, but experience so far with both 
2-channel and “2PH3” stereo indicates the above to be a 
safe ratio to use. 


Summary of Discussion 


. One may forget the “phasing” in this or any other stereo 
system. With 7,000 degrees of phase shift another 180 
degrees will make no audible difference. 

. The arrangement of Fig. 7 applies a nominal load of 12 
ohms to the amplifiers nominally rated at 16 ohms. This 
degrades the power rating some 25%, but even a 10-watt 
amplifier will have a large reserve when used with speak- 
ers which are reasonably satisfactory from the stand- 
point of efficiency. 

. The mixing of the two sides due to use of the circuit of 
Fig. 7 is much less than the mixing in the air itself in 
front of the microphones. I am assuming a low internal 
impedance, say, less than a couple of ohms. Incidentally, 
damping factor control is ill advised and may even induce 
self oscillation with or without the added middle channel. 

. The center speaker should be compatible in efficiency 
and response with the other two. In our system where 
typically a Klipschorn and Shorthorn are paired at the 
corners (both being 3-way systems) the center unit is 
also 3-way and uses the same tweeter unit and same 
midrange driver as the corner speakers. The middle 
unit uses a closed-box type of direct radiator bass, and 
the balancing network steps the treble units down to 
match. The bass unit must be excursion-limited to pre- 
vent distortion. Remember distortion in stereophonic 
is just as irritating as in monophonic, and one bad 
speaker can spoil a couple of good ones by blanketing 
them with its distortion. 


| |~ 
— 


JOURNAL OF THE AUDIO ENGINEERING SOCIETY 


APRIL 1958, VOLUME 6, NUMBER 2 


Stereo-Reverberation™ 


R. VERMEULEN 


Philips Research Laboratories, N. V. Philips’ Gloeilampenfabrieken, 
Eindhoven, Netherlands 


The quality of reproduced music depends not only on the true reproduction of the complete 
spectrum and the absence of spurious sounds (noise and distortion products), but also on the dis- 
tribution of the sound over the listening room. Stereophonic reproduction can simulate the direc- 
tions of the direct sound and thus can imitate, for instance, an orchestra in a satisfactory manner. 
This is not sufficient, as the room acoustics of the concert hall are essential. The diffuse character 
of the reverberated sound cannot be reproduced by a single loudspeaker or group of loudspeakers, 


and only in a limited way by stereophony. 


It can, however, be reconstructed, even from a single channel, by feeding several loudspeakers 
distributed over the hall, with different and aleatory delays, repeated many times at decreasing 


levels. 


INTRODUCTION 


ERE IS A fundamental difference between the trans- 

mission of speech and of music. In speech transmission 
the sole purpose is to convey “meaning.” The predominant 
quality is therefore “intelligibility,” which can be measured 
numerically by articulation tests. The purpose of the elec- 
tronic reproduction of music is the transmission of its 
“beauty.” I am aware that often a phonograph is used to 
produce merely the rhythm for dancing, no value being 
‘attached to the quality of the sounds, but this is not the 
kind of use we are interested in at the moment. As long 
as hiss, hum, non-linear distortion, etc., do not interfere with 
intelligibility they do not matter so much in a telephone 
channel, but in music we are much less concerned with 
intelligibility. When listening to a radio program, for in- 
stance, we are not prepared to tolerate atmospherics of such 
strength that we can only just distinguish that it is indeed 
the Beethoven Violin Concerto which is being performed. 
We already know the score well enough; what we demand 
from reproduction is not information about the score, but 
those elusive qualities of the performance that excite an 
esthetic response, whatever that may be. We do not ex- 
actly know what these elusive qualities are, so we are not 
in a position to measure them objectively. Therefore we 
become inclined to prefer the certain to the uncertain and 
to demand perfection in at least those qualities that we are 
capable of measuring quantitatively. Our aim then becomes 
“High Fidelity”; that is, literally, as exact a copy as possible 
of the sound-pressure variations produced by the musicians. 


* Read by Mr. John M. Hollywood at the Ninth Annual Conven- 
tion of the Audio Engineering Society, New York, October 9, 1957. 
Manuscript received October 7, 1957. 


So we extend the frequency range beyond what is musically 
relevant, we increase the power of our amplifiers, and we 
try to suppress the spurious sounds such as hum and hiss, 
sum and difference tones, modulation-noise and magnetic 
echoes below the audible instead of the disturbing level. 
But, while these measures to improve fidelity will also im- 
prove the quality of the reproduced music, they do not 
guarantee really “life-like” reproduction. By stressing the 
fidelity of the objectively measurable quantities too much, 
we incur, in my opinion, the risk of becoming absorbed in 
purely technical problems and of reducing the equipment 
to “a device for joining a signal generator to an output 
meter,” as Mr. Olney seems to have characterized this men- 
tality of the pure technician. 

Even when the acoustician puts the output meter behind 
a test-microphone at some distance in front of the loud- 
speaker, he is still in effect treating the equipment, not as 
a musical instrument, but as a transmission network. When 
we try to achieve perfection not only in electronics but also 
in acoustics and require that the sound-pressure variations 
at the ears of the listener should be exact copies of those at 
the ears of an artificial head, we ask for something that is 
impossible to realize, and, indeed, for more than is really 
needed. After all the human ear—and let us not forget the 
human mind—is very sensitive to deviations which it con- 
siders to be important, but is very lenient in other respects. 
The following examples will serve to illustrate this. 


PSYCHOACOUSTIC ASPECTS OF STEREOPHONY 


Though the instantaneous values of sound-pressure at 
different seats in a concert hall will certainly differ con- 
siderably, musical appreciation of the performance will not 
vary very much from one seat to another. The same is 
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STEREO-REVERBERATION 


Fie. 1. Two damped sinusoidal waves are produced by exciting 
two differently tuned circuits simultaneously. One wave (L) is 
conveyed to the left ear, the other (R) to the right ear. Amplitude 
and frequency are so adjusted that the sound seems to come from 
in front of the listener. The result is that the two waves have the 
same steepness at their fronts. (Drawing copied from an oscillo- 
gram.) 


true when the quality of a radio set is being judged. Al- 
though we will notice differences at different positions in 
the room—for instance there may be a distressing excess of 
high notes when listening in the beam of an over-directional 
loudspeaker—the general opinion will be much the same 
all over the room. This shows that differences in sound- 
pressure that are certainly measurable need not always be 
considered essential by the listener. On the other hand, 
when we put on a pair of head phones connected by means 
of two amplifiers to two microphones placed in an artificial 
head, we get very realistic reproduction, but the sound- 
sources seem to be located behind the back of the listener. 
The remedy is to make the artificial head move in accord- 
ance with the movements of the head of the listener. This 
is not of course a practical solution, but it serves to show 
that the listener gets more information from the sound-field 
than is contained in the sound-pressures at two fixed points. 
He cannot and does not want to keep his head still, and 
even imperceptible movements will tell him things about 
the sound pattern that enable him to reconstruct the origi- 
nal acoustic environment. 

When we realize this, the possibility of a satisfactory 
stereophonic reproduction via only two channels and two 
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Fig. 2. The start and jon finish (L) of a sustained tone are 
given by the left-hand loudspeaker, the rest (R) by the right-hand 
loudspeaker. The listener is quite convinced that all the sound 
comes from the left-hand loudspeaker which gives the transients. 


125 


loudspeakers seems little less than miraculous. That the 
mind will combine the two signals from those loudspeakers 
into a single imaginary sound-source is a wonderful example 
of the willingness of the mind to overlook differences which 
it considers inessential. We do not as yet understand the 
mechanism of direction-perception, nor how in the complex 
of sound produced by a large orchestra we manage to dis- 
tinguish and combine those components of sound-pressure 
at the two ears that proceed from a single instrument. The 
work of Mr. N. V. Franssen in our laboratory seems to 
indicate that the start of the sound plays a predominant 
part. (Fig. 1.) He found that two transient waves of dif- 
ferent frequency produced by simultaneously exciting two 
damped tuned circuits, and channelled separately to either 
ear, give the listener the sensation of a sound coming exactly 
from in front when, and only when, the steepness of the 
wave-fronts is equal. Another experiment is even more sur- 
prising when one takes part in the demonstration person- 
ally. (Fig. 2.) Of the two loudspeakers of a stereophonic 
set, one is supplied with only the first and the last 40 milli- 
seconds of a sustained tone, the other with the intermediate 
portion, which may have a duration of up to or even more 
than 5 seconds. It is quite impossible mot to believe that 
all the sound is coming as one continuous tone from the first 
loudspeaker, which in fact only produces the transients. It 
is even possible to add a second tone of another pitch in the 
inverse way, the result being that each speaker produces 
the transient of one tone and the intermediate portion of 
the other. Everybody will be quite convinced that each 
tone is coming wholly from that loudspeaker which in fact 
only gives its transients. With only one tone, it is remark- 
able that the modulation noise originating from the mag- 
netic recording of the tones does not share in the effect; 
one distinctly hears the tone coming from one, and the 
accompanying noise from the other, loudspeaker. Mr. 
Franssen hopes to publish a more full account of his find- 
ings in the near future. 


SIMULATION OF THE CONCERT HALL 


We see that the ear can be deceived, and that it is not 
necessary to duplicate precisely the sound-pressures at the 
ears of the listener to give him the desired impression. 
Good stereophonic reproduction can simulate even a large 
orchestra satisfactorily, as has been shown already on sev- 
eral occasions where music was reproduced stereophonically 
in the concert hall. For example, an experiment in the 
San Francisco Opera House was reported by Selsted and 
Snyder,' where the orchestra went through the motions of 
playing while the sound came from a stereo tape. 

Several years ago we asked some 300 people to compare 
in an A-B-X test the stereophonic reproduction of dance- 
band or string-quartet music with the music played by the 
musicians themselves who were behind a curtain. Figure 3 


1W. T. Selsted and R. H. Snyder, Proc. 2nd International Congress 
on Acoustics, Cambridge, Mass., June 22, 1956. 
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Fic. 3. The percentage (y) of 310 subjects as a function of the 
percentage correct answers (x) to the question whether the last of 
three identical short pieces of music was given via the same 
**transmission system’’ as the first or as the second. These ‘‘sys- 
tems’’ were either live musicians behind an opaque curtain or 
loudspeakers giving stereophonie reproduction of a tape-recording 
of the same musicians. The mean value (m) is 75%. 


gives the frequency-distribution (y) of the right answers 
(x). One has to bear in mind that an average of 50% 
right answers would suggest that the listeners were com- 
pletely guessing, in other words, that they could not hear 
any difference. As there were musically and acoustically 
trained listeners among them who gave 100% right an- 
swers under optimum conditions we consider the mean (m) 
of 75% a good result. 

Does this mean that stereophony is the last word in music 
reproduction and that, though there may still be some 
technical progress, all the essential elements can now be 
reproduced stereophonically? Our answer is “no.” We 
have only claimed, and we did so intentionally, that stereo- 
phony was capable of giving a good imitation of an “or- 
chestra.” But to reproduce a “concert” satisfactorily the 
stereophonic reproduction needs the concert-hall and its 
acoustics just as well as a live orchestra. We all know that 
the acoustics of the concert-hall play an important part in 
appreciation of the performance, so how can we expect the 
stereophonically faked orchestra to do well without their 
support? 


NECESSITY FOR AUGMENTED REVERBERATION 


It is true that by suitable positioning of the microphone 
we can pick up reverberant sound from the hall and repro- 
duce the prolongation of the sound. But here again, we 
are apt to make the same mistake in that we reproduce 
only one and—of course—the only measurable character- 
istic of the sound-field, viz. the reverberation time, but 


neglect its spatial distribution. The loudspeakers of a 
stereophonic set can never reproduce the sound-field in the 


concert-hall with any accuracy in the home; how indeed 
could they do so with only the data from two microphones 
at their disposal? Neither can they deliver to the listeners’ 
ears exact copies of the instantaneous sound pressure at the 
place of the microphones, if only because the listeners are 
free to move their heads and are sitting at different places. 
The loudspeakers can only produce a quite different sound 
field, which will nevertheless give an impression that re- 
sembles the original in certain respects. But not in all 
respects, because they are only capable of simulating sound 
sources in the space between them. Thus stereophony can 
only widen the “hole in the wall of the concert-hall,” to a 
“large window,” but it cannot give the listener the impres- 
sion that he is present in the auditorium. It cannot imitate 
the sound reflected from all sides by the ceiling and the 
walls. This does not matter as long as the reproduction 
takes place in the concert-hall itself where the ceiling and 
the walls are present to produce reverberated sound with 
the desired properties. In a living-room, however, the ab- 
sence of enough reverberated sound or at least its different 
character places the listener in the position of an outsider. 


Methods 


Again, the ear demands that the music shall come to it 
from all directions, but it is not very critical; it does not 
demand an exact reproduction of the sound field in its 
neighborhood, it does only ask for a sound field similar 
in certain respects. Thus it becomes possible to give a cor- 
rect impression of the acoustics of a concert hall by means 
of a merely superficial imitation of the reflected sound. 
This can be done by electro-acoustic means. (Fig. 4.) 


Fic. 4. Stereo-reverberation. Apart from the two stereophonic 
loudspeakers (L & R), several other loudspeakers are distributed 
along the walls of the room and are fed with different time-delays, 
e.g. group 1: delay 40 milliseconds; group 2: delay 78 ms; group 
3: delay 125 ms; group 4: delay 150 ms. The broken lines show 
that the sounds from loudspeakers of the same group have to travel 
different distances and will not reach the listener simultaneously. 
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STEREO-REVERBERATION 


Apart from the two stereophonic loudspeakers (R and L) 
a number of loudspeakers are distributed over the room, 
preferably along the walls and near the ceiling. These 
loudspeakers are connected in groups (1, 2, 3, 4) in a ran- 
dom way to some power amplifiers. The input signals to 
these amplifiers are delayed by different amounts. In this 
way the ears receive first the direct, stereophonic signals, 
and then the same music repeated several times with differ- 
ent time-delays from different directions and with decreas- 
ing intensities. The variety in delay-times is increased by 
the different distances to the listener of the loudspeakers 
of the same group. 


There is no need for the delays to be created in the re- 
producing process; they can be introduced during the re- 
cording. However this necessitates a separate sound-track 
for each delay-time. If, for instance, six tracks should be 
available in a large cinema, we would prefer to use three 
tracks for stereophony and the other three for reverberation, 
each with a different delay-time. 

It is necessary both that the reverberation signals should 
come from different directions, and that their time-delays 
should differ. If they all came from the same direction 
they would not produce the impression of “diffuseness.” If 
we did not use different time delays, signals from the loud- 
speakers would combine, like stereophonic sound, into only 
one phantom source. It is true that a single delay and two 
loudspeakers at different distances produce already some 
degree of spatial effect, but the smoothness of the reverber- 
ant sound is much improved by using more loudspeakers 
and a variety of delay-times. 

The available delay-times are allotted to the various 
speakers in an aleatory way, but with the following re- 
strictions: 

No loudspeakers shall have so short a delay that their 
sound reaches any listener in advance of the direct sound 
produced by the stereophonic loudspeakers, for then the 
music would appear to come from that loudspeaker. On the 
other hand the interval between the direct sound and the 
first delayed sound must not be longer than, say, 40 milli- 
seconds for any listener in the hall; otherwise he may hear a 
separate echo. The intensity-level of any of the loudspeakers 
must not be too high compared with the rest, lest it be sepa- 
rately audible, but surprisingly, with the installation well 
adjusted, it is impossible to tell whether a given loudspeaker 
is functioning, even with the ear at a distance of two feet or 
even one foot. To simulate a large auditorium or cathedral 
the time-delays should be rather long (40 milliseconds or 
more) and preferably one of the speakers with the longest 
delay should have a somewhat higher volume level than the 
preceding ones. With small intervals and smoothly decreas- 
ing levels the impression, notwithstanding a large reverbera- 
tion time, will be that of a small room, something like a 
bathroom. 

The delays may be obtained in many ways. (Fig. 5.) 
Our practice has been to re-record the two stereophonic 
signals together on a wheel covered with a magnetic layer 
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and to take them off again by means of several reproducing 
heads; each head is connected to a separate power-amplifier 
which feeds several of the loudspeakers at different posi- 
tions in the room. By feeding back the signal of the last 
head into the recording head, the number of repetitions is 
increased indefinitely though of course with exp :nentially 
decreasing level. A potentiometer (P) in the feedback 
path controls the rate of decrease and thereby the “rever- 
beration-time.” 


Stereo-Reverberation 


We have not said very much about the reverberation 
time, which is often considered as the primary characteristic 
of the acoustics of a room. The reason is that we have the 
impression that the diffuseness of the reverberated sound- 
field is more important for musical appreciation than the 
prolongation of the sound. If we were to give the same 
delayed signals over the stereophonic loudspeakers, we 
would lose the sense of spaciousness and of being sur- 
rounded by music, though the reverberation time would re- 
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Fic. 5. The two signals taken from the two tracks (r & 1) of the 
stereo-tape (ST) are fed direetly into separate loudspeakers (R & 
L) and are, moreover, combined and re-recorded together on the 
magnetic layer on the rim of the delay-wheel (D). The signal is 
picked up from the wheel (D) with different delays by a number 
(here 4) of reproducing heads, each feeding a separate group (1, 
2, 3, 4) of loudspeakers, distributed over the room, for instance as 
shown in Fig. 4. The group numbers (1, 2, 3, 4) correspond in the 
two figures. The erase-head (EH) removes the recording. The sig- 
nal from the last reproducing head is also fed into the recording- 
head (RH). The potentiometer (P) controls the feedback and 
hence the reverberation-time. All heads are at a distance of about 
1% mil from the magnetic layer, so their life is unlimited. 
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R. VERMEULEN 


Fic. 6a. View of the delay-wheel (D) complete with the neces 
sary amplifiers, controls, output meter and control loudspeaker. 


main unchanged. Moreover, the regularity caused by the 
repetition of the same sequence of time-delays then becomes 
conspicuous and is audible as a flutter-echo, which need not 
be the case with the diffuse artificially reverberated sound. 
On the other hand, when we cut out the feedback, we still 
get the pleasing effect of good acoustics, while the music 
remains clearly defined and does not become muddled, as 
can easily occur with a long reverberation-time. I do not 
think it is possible to build a hall of conventional design 
which will have a well diffused sound field, i.e., produce a 
great many strong reflections, without at the same time 
having a long reverberation-time. So we have become ac- 
customed to associate one with the other, and to take the 
easily measurable property for the essential one. We have 
called the diffuse artificial reverberation: “stereo-reverbera- 
tion” to stress the importance of its distribution in space. 


APPLICATIONS OF STEREO-REVERBERATION 


As regards the practical application of stereo-reverbera- 
tion the need for many channels seems to rule it out for 
broadcasting and the domestic phonograph. If two and 
only two channels are available as is often the case with 
tape recorders, it is in our opinion still an open question 
whether it is best to take both channels for stereo-tracks or 
to take one for a direct and the other for a delayed track. 


The Post Office authorities in the Netherlands are experi- 
menting on the best way of using a combination of two of 
the four available circuits of their wired-radio network, in 
order to give listeners one double channel with better quality 
of reproduction. They have not yet decided what is pre- 
ferable, stereophony or stereo-reverberation. An individual 
delay-producing apparatus in each home would certainly be 
more flexible, but we do not know of a sufficiently inexpen- 
sive solution with the necessary quality and reliability. 
Commercially, therefore, stereo-reverberation has only 
been applied to theatre acoustics, either to produce special 
effects in an opera or a play, or to improve the diffuseness 
in an auditorium and thus make it suitable for orchestral 
performances. To improve the acoustics of a hall with too 
little diffuseness or too small a reverberation time, even an 
elaborate electro-acoustical installation will be less expen- 
sive than making structural alterations. But, quite apart 
from this, a theatre can be given a double set of acoustic 
properties by installing stereo-reverberation; a theatre pos- 
sessing good acoustics for speech can thereby be made suit- 
able for orchestral performances without its virtues as a 
theatre being impaired. Figures 6a and 6b show the ap- 
paratus for these purposes. The same principles apply, the 
same loudspeakers, the same delays can be used, only in- 
stead of using a recording as the input, one or more micro- 
phones are placed over the orchestra. A serious difficulty 
is the second feedback path from the loudspeakers to the 
microphone with the room as an intermediate link, which 
shows a very irregular frequency-characteristic with a few 
exceptionally high peaks. These give the circuit a tendency 
to start oscillating at one or two frequencies long before 
the average reverberation time has been prolonged appreci- 
ably. The peaks cannot be suppressed effectively by elec- 
trical means because they are very narrow and moreover 
are apt to change both in frequency and height when per- 
sons, objects or curtains are shifted. Every precaution 


Fig. 6b. Close-up view of the delay-wheel (D) with driving mo- 
tor (M), flywheel (F) and heads (H). 
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Fic. 7. View of the stage of La Scala Theatre in Milan. In the cove along the ceiling the 
loudspeakers of the stereo-reverberation installation are concealed, as shown in the diagram for 
one of them. 
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Fig. 8a. The auditorium of the Palais Chaillot in Paris, the ar- 
rows indicating the places where the loudspeakers of the stereo- 
reverberation installation are concealed in the cove; some of them 
are only just visible at the back. 


Fie. 8b. Proscenium arch of the stage at the Palais Chaillot, at 
the sides of which are loudspeaker columns for stereophonie repro- 
duction of speech (R & L) and of orchestral musie (DL), the loud- 
speakers for the stereo-reverberation (SR) being concealed all 
around in a cove. 
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must be taken in the positioning of the microphones, the 
choice of their directivity and the shaping of their fre- 
quency-characteristic, to achieve a werth-while improvement 
in acoustics without any tendency for a tone to go on 
sounding after the rest of the chord has decayed. 

But it can be done, and I consider it as evidence that 
electro-acoustics has come of age when well-known musi- 
cians are willing to accept the assistance of loudspeakers, 
not to produce greater loudness, but to improve the quality 
of their live concerts. For three years Willem van Otterloo 
has been conducting the 100-man “Residentie Orkest” in 
a hall equipped with stereo-reverberation; the illustrations 
show where loudspeakers for stereo-reverberation have been 
concealed in the famous La Scala Opera House in 
Milan (Fig. 7), and in the “Theatre Chaillot” in Paris, 
(Figs. 8a and 8b), where moreover there are loudspeaker- 
columns for stereophonic amplification of speech (R and L) 
at the sides of the proscenium and two other loudspeaker- 
columns for the amplification of the orchestra (DL) at the 
top of the proscenium. 


R. VERMEULEN 


CONCLUSION 


To conclude, I should like to give the warning never to 
exaggerate! People will perhaps say that your installation 
is no good, that it is only the orchestra which has improved. 
Still, do not be tempted to demonstrate your achievement 
by laying it on too thickly. Under no circumstances must 
the public become aware of the use of loudspeakers, for 
their reputation has become so bad by misuse that the mere 
suggestion that they are present can spoil appreciation of 
the performance, even when they are not in use. 
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British Patent Specifications 
(Continued from page 98) 


A good binaural effect may be obtained by giving direc- 
tional significance to only a limited range of frequencies. For 
example, although good reproduction requires the transmis- 
sion of all frequencies up to, say, 10,000 c.p.s. yet a good 
directional effect is obtained from frequencies up to, say, 
3,000 c.p.s. This would assist disc recording of the binaural 
impulses since the lateral cut which represents the sum of the 
two channels to the speakers might have a frequency range 
extending to 10,000 c.p.s. whereas the hill-and-dale cut need 
transmit frequencies no higher than 3,000 c.p.s. This would 
considerably simplify the design of the recorders and pick- 
ups in that low inertias would only be required for the lateral 
cut and design would thus be greatly simplified. 

These frequencies are given merely by way of example, 
are not necessarily the optimum frequencies for design of 
this character, which will be determined by other considera- 
tions. 

In transmitting the two channels indicated in the various 
systems above described, instead of employing line trans- 
mission, radio transmission may if desired be employed. 
Each channel may be separately transmitted or preferably 
the two channels may be sent as different modulations of 
the same carrier wave. Thus one channel may be transmitted 
as an amplitude modulation and the other as a phase or fre- 
quency modulation of the same carrier wave. Alternatively 
the two channels may be transmitted as amplitude modula- 
tions of different carrier waves which are 90° out of phase, 
the two waves being radiated from the same aerial in combi- 
nation as a single wave propagation. Various systems for 
the transmission and reception of duplex radio signals along 
these lines are known and any one of such or similar ar- 
rangements may be used in connection with the invention 
described herein according to its applicability or convenience 
in the circumstances under consideration. It must be under- 
stood that with such a system of duplex radiation, it is 


possible, if desired, to perform one of the summing and 
differencing processes in the radio link. For example, by 
demodulation at the receiving end with two carrier waves 
90° out of phase, which carrier waves are 45° out of phase 
with the original modulating carriers, the resultant low fre- 
quency channels are the sums and differences of the original 
low frequency channels at the transmitter. 

The hereindescribed system while being especially applic- 
able to talking pictures is not limited to such use. It may 
be employed in recording sound quite independently of any 
picture effects and in this connection (as well as when used 
in cinematograph work) it seems probable that the binaural 
effect introduced will be found to improve the acoustic 
properties of recording studios and to save any drastic acous- 
tic treatment thereof while providing much more realistic 
and satisfactory records for reproduction. Furthermore, the 
system may clearly be employed when the microphone out- 
puts are led to the loud speakers instead first of being re- 
corded, and such an arrangement may for example be em- 
ployed in public address systems in which directional sound 
effects are desired. In general the invention is applicable 
in all cases where it is desired to give directional effects to 
emitted sound. Also in all cases, both when the impulses are 
fed to the loud speakers without recording and when they are 
recorded for subsequent reproduction the total modification 
and/or interaction of the channels may be accomplished in 
more than one stage. For example, using pressure micro- 
phones, the low frequency phase differences may be aug- 
mented, the medium frequency phase differences converted 
to amplitude differences, and the high frequency amplitude 
differences augmented in a first stage of modification; the 
low frequency phase differences may then be converted to 
amplitude differences in a later stage of modification. One 
or both of these stages may occur either before or after the 
sound has been recorded. In this manner the very small low 
frequency phase differences are augmented before they are 
amplified, so avoiding troubles due to small low frequency 
phase shifts in amplifiers. 
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INTRODUCTION 


Ross H. Snyper: Welcome to the Symposium on Stereo 
Tape Standards. We have tonight a discussion we hope 
may be of importance, in the long range work of the So- 
ciety. We probably will create no sensations, but if we 
achieve some agreements here among us and with the gen- 
tlemen in our audience, we may create better markets for 
the equipment and the tapes and discs which we are inter- 
ested in making, and we may reduce the costs in doing so. 

Each of us had a somewhat different position on the sub- 
ject of what standards, if any, should be adopted for com- 
mercially released stereotapes. We expect in our discussion 


to have some agreements and some disagreements. In an 
effort to establish the ground to cover, if not necessarily to 
agree upon, I have a prepared statement in which I, as the 
representative of an equipment manufacturer rather than 
the manufacturer of the finished stereotape, suggest that it 
is appropriate that we establish some standards, as an 
engineering society, and recommend them for adoption by 
other societies, for example, the Magnetic Recording Indus- 
try Association (MRIA), The Record Industry Association 
of America (RIAA), and perhaps, eventually, the American 
Standards Association (ASA). 


PREPARED STATEMENT BY THE CHAIRMAN 


Standards for stereotape may be established either by 
agreement after open discussion or by application of the 
most widely prevailing expedient. The history of stand- 
ardization in the disc industry suggests that widespread 
agreement, for example, on equalization, may be reached 
only when the state of technology is sufficiently advanced 
that the extremes are clearly identified arid the area of de- 
sirable performance somewhat narrow. In the disc indus- 
try, the cost of waiting this long was very high, and the 
resulting standard was in many respects near optimum, both 
technically and economically. It may be that commercial 
tape will be even more fortunate. 


A STANDARD FOR EQUALIZATION 


It appears that in the matter of tape equalization, it 
would be technically desirable and economically feasible to 
adopt now the standard which has the widest usage in the 
professional tape recording industry. Precise definition of 
any tape equalization standard may perhaps best be given 
in terms of the reproducing curve which will give back flat 
response when applied to the playback of a magnetic head 


which has negligible gap and electrical losses. It is not 
easy in professional working situations to apply this defi- 
nition, however, because it is not easy to ascertain if a given 
playback head does indeed have negligible losses. Efforts 
to relate the equalization standard directly to the absolute 
value of surface induction on tape have floundered on the 
colossal problem of measuring surface induction accurately 
in the working situation. 

The most promising practical arrangement appears to me 
to be to designate as standard a number of standard tapes 
very carefully checked by participating laboratories for 
equality of induction from tape to tape. If we establish in 
the laboratory that these play back flat through a given 
high quality head, through an amplifier of given curve, 
then we may seat the standard in the tapes rather than in 
an equation. Duplicates of such tapes can, we believe, be 
made inexpensively to an accuracy of + ™% db or perhaps 
even + % db.! 


1J. B. Hull, “A Simplified Method for the Performance Measure- 
ment of Magnetic Tape Recorders,” Proc. 1.R.E.-P.G.A., March 1956. 
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As for the curve selected, a 7% ips standard currently 
in use in more than 90% of the professional studio magnetic 
recorders in this country might be examined profitably. 
This curve, the same as that already adopted as standard 
for 15 ips by NARTB, and adopted at present as a pro- 
posed standard for 7% ips by NARTB, was originally 
selected with two considerations in mind. Stories have 
been told, some of them hypothetical, some of them true, 
about the way in which the standard was first assigned. 
This is the situation as it really was: First, it was con- 
sidered economically desirable that the reproduce charac- 
teristic be simple in form and be applicable, if possible, to 
at least two operating speeds, since the likelihood appeared 
great that tape reproducers, especially in the home, would 
one day greatly outnumber tape recorders; and second, 
technically, a reproduce curve should assist in establishing 
room for recording all of those sounds which are character- 
istic of audio entertainment, while economically permitting 
the suppression of those noises which are characteristic of 
the medium and which are most likely to be objectionable. 
The possibility of using the same reproduce curve at 7% 
as at 15 was attractive. An investigation was made of the 
recording area which it provides on the chart of frequency 
vs energy. This has been reported elsewhere.” Briefly, it 
reported that the limits imposed by the tape itself permit 
with this curve, an overall signal-to-noise ratio of somewhat 
more than 60 db over a range of 30 to 15,000 cps. Contro- 
versy, so far as it existed concerning that standard, was 
over the limitation imposed upon high-frequency, high- 
energy transient peaks. The high frequency dynamic re- 
serve of tape recorded to this curve ranges downward, from 
9 up to 15 kc, between limits of 40 and 32 db. This is a 
little less than the reserve in the outermost grooves of an 
RIAA-equalized 12” hot-stylus “Microgroove” lacquer disc. 
It is about equal to the outermost grooves of the 10” lac- 
quer, and it is substantially superior to the grooves farther 
in. Since the 7% ips speed is intended mainly for release 
rather than for mastering tapes, these parameters seemed 
to us highly appropriate, particularly since, with tape’s 
constant velocity, its reserve is continuously available, and 
since, with tape’s immunity from abrasion effects, neither 
noise nor distortion increase with playing. 

The alternative, if we are not to increase tape speed, 
would be to reduce the degree of pre-emphasis and post- 
de-emphasis, and proportionally to raise the noise level. 
This noise increase would entirely be in hiss, which is pres- 
ent even with the best of recording methods, and which is 
the most troublesome of noise problems—usually too close 
to audibility for the comfort of most recording engineers. 
Indeed, most serious consideration was given, when that 
tentative standard was built into a number of machines, to 
making the pre-emphasis considerably steeper than it is. 
Little or no consideration was given, after these calcula- 


2 James L. Havstad and R. H. Snyder, “Equalization Considera- 
tion in Audio Magnetic Recorders,” Proc. 1.R.E.-P.G.A., March 1956. 
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tions were completed, to the possibility of reducing the 
amount of pre-emphasis, simply for the reason that I have 
given, that hiss is already the problem that bothered us 
most. 


A STANDARD LEVEL 


If we should adopt a designated tape for our standard, 
it might be well to include on it a standard level signal. 
Such a standard, again, has existed for some years and is 
in wide use in professional recording studios. It was first 
established as that level of 1000-cycle tone magnetization 
of the tape at which total (RMS) harmonic distortion 
reaches 1% due to the approach of tape saturation. As 
tape improved over the last several years the distortion at 
this absolute level decreased. With present tapes it now 
stands at 0.95%. There is, of course, no reason why this 
should be regarded as “zero” on the standard VU indicator 
for release tapes, since special circumstances might require 
lower distortion with less signal-to-noise ratio, or might 
permit a higher distortion level at peaks in favor of lower 
relative noise. But it would serve as a reference to which 
individual tapes or individual releases could be referred. 


A STANDARD FOR TRACK PLACEMENT 


The second area in which standardization appears to be 
both desirable and feasible now is that of track placement. 
The stereophonic effect is highly sensitive to mutilation if 
tracks originally recorded simultaneously are not repro- 
duced in synchronism. At 7% ips audible deterioration 
sets in when the sync error exceeds about 0.001” longi- 
tudinally on the tape. 

The staggered stereotape was devised as an economy 
expedient since it permitted the use of two inexpensive 
readily available half-track heads on the reproducer. Ex- 
perience has shown rather quickly that the cost of main- 
taining inter-gap separation, from machine after machine, 
within one mil of the usual stagger, was excessive. After 
a short period of time the entire industry has made avail- 
able in-line stereotapes and in-line machines. Now that 
there is no economic advantage in the staggered system, the 
technical advantages of the in-line arrangement may well 
persuade us to abandon stagger once and for all so that the 
consumer may be relieved of confusion and the stereotape 
market expand unhampered. 


A STANDARD FOR SPEAKER PLACEMENT? 


A much more controversial subject is the possibility of a 


standard for speaker placement. The consumer of high 
quality stereo recordings quite justifiably generally places 
his loudspeakers wherever they sound best and look best 
to him. Yet perhaps no question is more commonly asked 
of us than “Where should I place the loudspeakers?” We 
might well want to establish a standard position which we 
might adopt in recording if only so that we may predict 
what effect the customer will hear. Clearly, nothing so 
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STEREO TAPE STANDARDS 


simple as “6 feet from center to center of cone” can possibly 
serve, since rooms differ so much in size and acoustics, and 
since the distance from speaker to listener plays so impor- 
tant a role; in other words, because the information with 
which we are dealing is angular rather than spatial. The 
artistic variations which stereo affords should surely not 
be shackled by any rigid standard. Yet, if the producer 
is to have any idea of what his listeners eventually will 
hear, perhaps a standard way of monitoring during record- 
ing would be helpful and should be tried. Apparently, 
angular separation might be a reasonable hook to hang to. 
It is suggested that every monitor situation, however large 
or small, and of whatever acoustics, could be fitted with 
speakers equidistant from the director. Direction can be 
subjectively determined, not with measurements, but within 
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surprisingly narrow limits by ear, if tones in the 2000-cycle 
region are fed first to one and then to the other speaker. 
If a standard angle were selected, the first step would be 
taken toward giving director and consumer at least a 
roughly known position, even if this were only to be used 
as a point of departure. This might be 45°, 60°, or 90°, 
for easy measurement. To be sure, only the enthusiasts 
and the consumers would be quick to learn about this stand- 
ard and to know how to apply it. But the whole high 
fidelity industry is based on the enthusiast. What he does 
with the standard stereo speaker position today may be 
incorporated into machines for the millions in the future. 

With this, then, as something to agree with, or differ 
with, or modify, or change, I will ask that Mr. Miltenburg 
make some comments. 


PANEL DISCUSSION 


EQUALIZATION PROBLEMS 


Mr. MILTeNBuRG: Ross, you know that I am probably 
one of the greatest advocates of standardization in the re- 
cording industry today. We are producing a lot of pre- 
recorded tapes using the 7'2-ips NARTB curve, and frankly 
we haven’t experimented, or gone into reasons to use, or not 
to use this curve. We have merely accepted it because, 


first of all there was a 7'4-ips standard tape available. It 


would be wonderful if we could assume that the standard 
playback characteristic for 7% and 15 is the best curve 
possible and the ultimate for the tape industry and for 
home use in the future. I believe there is something to be 
desired in this curve—the tape hiss situation is still our 
biggest problem. No matter what we do, we still have a 
lot of tape hiss, not only on the 7%4-ips copies for home 
use, but even on our masters. We are unable to bring the 
tape hiss down to where we think it should be. Discussions 
with the tape manufacturers give us hope, but I believe we 
need a little more than hope; it’s been a long time since 
anything has been done about it. 

SNYDER: We certainly got well started on the subject of 
equalization. More certainly should be said, and more 
certainly needs to be learned, about the causes and possible 
ways to reduce tape hiss. John Beaumont, would you make 
some comments on equalization? 

JoHN Beaumont: I do have a comment on equalization 
because it’s a sort of vested interest right now. The MRIA 
is making an approach to a recommended standard for 7%- 
ips reproduction. Inasmuch as I have the dubious pleasure 
of heading the sub-committee, it’s a current project. Our 
feeling at this point is that the 7'4-ips reproduce curve 
which is the proposed NARTB, or, as most of us usually 
call it, the Ampex curve, does present some problems, and 
a survey made by the MRIA indicates that the NARTB 
characteristic is far from universal among American ma- 
chine manufacturers. It is the accepted standard, appar- 


ently, for all professional recording equipment, but the 
home recording field, which is a large part of our considera- 
tion as an industry association, is split more or less evenly 
between the NARTB type characteristic and a character- 
istic known as the CCIR, which is the current European 
standard and differs from the other curve largely in that it 
includes less record pre-emphasis, and hence less playback 
de-emphasis. Some manufacturers go even further in this 
standard and put much less pre-emphasis and consequently 
much less de-emphasis into their machines. But lumping 
those two categories together under the same philosophy, 
more manufacturers prefer Jess pre-emphasis, if you would 
like to put it that way. 

Our consideration for proposing a standard led us to 
propose a partial compromise, which consisted of the CCIR 
high end, which slopes out in much the same way as the 
other characteristic, with a maximum of 5-db difference at 
12 kc or thereabouts. However, with regard to signal-to- 
noise difficulties, which is the main claim against this being 
adopted as an American standard, in that area where the 
ear sensitivity is greatest, the general region around 3000 
cps, the difference among curves is approximately 2% db. 
Now, since the pre-emphasis is less, it is theoretically possi- 
ble to raise the recording levels without the danger of going 
into high-frequency overload to some degree. This and 
other factors lead us to believe that the signal-to-noise 
difficulties are probably offset by other factors and we do 
not feel that this alone is sufficient to decide one over the 
other. We chose the NARTB curve below 1,000 or 700 
cycles because it offers particularly the home machine manu- 
facturer a little easier problem with his amplifiers, below 
100 cycles. The combined curve then looks like NARTB 
at the low end and CCIR at the high end. It has this 
advantage for the manufacturer: the NARTB curve ab- 
sorbs some 33 db in the equalization circuits, considering 
both high and low ends. The CCIR curve absorbs 30 db. 
The composite curve absorbs 28 db, a saving of 5 db of 
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gain in the system, something which is not trivial, particu- 
larly to the maker of the home machine who is perhaps 
working a little harder than the professional maker to make 
his equipment good with the least number of stages. There 
are perhaps many other arguments, but I wanted to spell 
out the shape of things. We don’t think it’s too radical 
a proposed change, but we do feel that other factors be- 
sides super pre-emphasis alone are responsible for some of 
our signal-to-noise problems. 

SnypER: Thank you, John, it can hardly be doubted 
that our problem is not as simple as equalization alone; it’s 
also clear already that we have a substantial area of need 
to find agreement. I would like now to ask Mr. Ed Soren- 
son, who has had many years’ experience in this same area, 
to add his comments to those which have already been made. 


Ep Sorenson: I do agree with the other two speakers on 
the various problems. Looking at it from the point of view 
of RCA or CBS-Columbia Records, our natural inclination 
is to use the same equalization for 74% and 15. It makes 
our life much easier when working with a multiplicity of 
machines. I think you can appreciate that. But when we 
try to define a standard which might possibly encompass 
30 or 40 other small-machine manufacturers (I don’t mean 
small in stature) for the consumers of our product, I feel 
that we should do what we did when we tried to start a 
standard curve in disc recording: look at it from a playback 
point of view. If tapes are to be made, it’s the potential 
market we have to look to. If there is a middle ground 


where we can all concur, I think all of us might have some- 


thing to gain. It’s rather encouraging to see that the 
manufacturer, who makes the machine that we hope to play 
all these tapes on, has a chance to bring out the picture. 
As moderator, I think it devolves on Mr. Snyder to review 
the problem that he has, having more or less established 
some standards. It might well be possible to find a middle 
ground. 

As to standards on direction or speaker location, would 
it not be possible to specify an angular placement standard, 
somewhat along the lines of Dr. Olson’s suggestion in his 
paper? We might also suggest a standard level! Most of 
you remember all the control rooms you have been in, and 
some of the sound intensities you have been exposed to, 
some up in the 95- and 100-db ranges, and you wonder how 
a person possibly can balance, whether its monaural or 
stereo or whatever. We have the same problem here, in 
balancing for the consumer. What acoustic level should 
we standardize on? I think that should be thought about, 
and tied in with this business of angle and direction, etc. 

But in all the other cases, I see no great disagreement 
at all, with the one provision, we must think of the problem 
of the people who are making the machines that are going 
to reproduce the tapes we hope to sell in great numbers. 

SNYDER: We have, then, two subjects which are in many 
ways related and in some ways separated from one another, 
—this knotty one of equalization, which I would like to 
make another comment or two on, and the subject of stand- 
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ard speaker placement and perhaps standard level. Let me 
dwell on the latter for just a moment. 

We have some studies which indicate that the average 
level at which the high quality phonograph owner plays 
his machine for incidental listening is in the 60- to 70-db 
absolute range, as read on the standard level indicator, and 
that the average level for music to which one’s full attention 
is given, runs somewhere above 75. The maximum level 
is not given in the studies we have. Do you think that 
there’s material for consideration here, Mr. Sorenson? 

Sorenson: Yes, I do. I believe most of us have been 
forced at one time or another to arrive at what might be 
an approximation of a standard listening intensity, and we 
have many times set our desired value at about 75, perhaps 
78, but never exceeding 80. Of course, you must make 
allowance for the ambient noise in the area, but somewhere 
between 75 and 80 seems to me a good logical intensity to 
use. That has been our finding. 

SNyDER: Do you differ with that, Mr. Miltenburg? 

MILTENBURG: Yes, I do. We have just finished a series 
of tests in controlled monitor listening, and with rare ex- 
ceptions find that the monitor level in the control rooms is 
around 100 db. I believe possibly that some of our engi- 
neers are a little lacking in sensitivity, but the overall con- 
sensus, and the actual operating procedure, show that our 
monitors, in every monitor room where we have our studios, 
shoot well over 100 db. 

SNYDER: We are just about on the threshold of pain! 

SORENSON: We have seen the same thing and been 
amazed at what happens. I suppose the same practice at 
RCA is repeated all around the country. 

Beaumont: I think perhaps this high level is sometimes 
habitual, that the engineer riding the board is listening for 
things which are not music, he’s listening for studio noise 
and other extraneous low-level matter. However, some of 
us get tired of that and we drop our levels, and, if you work 
for a commercial recording studio, the client sets your level, 
you don’t. 

MILTENBURG: We have fried to drop the level in our 
monitor rooms. I feel that this is a semi-annual business 
in RCA. But when we tried to drop it by 30 db, the re- 
sulting records were pretty sad. Reception seems to drop 
off very sharply below 90, and it becomes more acute as 
level goes up. I believe if we had monitor systems that 
were capable of producing 200 db we would be using them! 
But the commercial releases which come out of present-day 
manufacturing sources leave much to be desired. 

Snyper: All of us have noticed the quality improvement 
within the past years; it has been improving the whole 
period, and all of us must take great gratification in the 
quality of discs and the quality which we are now getting 
in tape. So we must be talking about borderline effects and 
borderline improvements which might be made. _There- 
fore, interesting as the psychological facts on the level pref- 
erences of recording engineers may be, we might return to 
the subject of equalization. I think it goes without saying 
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the firm which I represent has a vested interest in certain 
characteristics. I know that you will forgive me for not 
having readily at hand answers to some of the objections 
to that curve, which Mr. Beaumont has raised. We have, 
after all, devoted a considerable amount of time and study 
to the curve. 

There is an extremely easy way in which 5 or 6, or 7 or 8 
db of gain may be saved in amplifier design by tape re- 
corder manufacturers, while yet adhering to the proposed 
NARTB 7'-ips standard. One of these would be to limit 
the highest recordable frequency so that the pre-emphasis 
is carried only to 8000 or 10 000 cps, yet keeping playback 
capability still to 15000 cps. It is, of course, perfectly 
true that in order to carry the record curve and playback 
both to 15 000 cps you need a total of 33 db of equalization. 
But there is a 6-db difference between 10000 and 15 000, 
and there is better than 8-db difference between 8 000 and 
15 000 cps. 

An inexpensive magnetic tape recorder is an entirely re- 
spectable piece of equipment. Its design should be treated 
with honesty and with respect, and with respect we should 
not expect an inexpensive magnetic tape recorder to go to 
15 000 cps at 7% ips. 

We are talking about mechanical accuracies in micro- 
inches when we do this. We can reasonably expect a ma- 
chine of professional standards to do it, but there is no 
reason, with increasing honesty for the product, and with 
the respect for the intelligence of its customers which is 


becoming characteristic of the high fidelity industry, that 
we should expect a $100 or $200 recorder to get 15 ke at 


7% ips. We can preserve the high degree of hiss rejection 
which the proposed NARTB characteristic offers us, without 
building into the machine this very large amount of equali- 
zation which has been mentioned. I believe it would be 
much more honest to do so than to seek a compromise in 
which hiss level is promoted, if not directly increased. I 
grant, of course, that there are other causes for hiss; in- 
efficiency of the playback head, of course, is likely to be 
one. Nearly all of these potential causes of hiss are, how- 
ever, minimized—we certainly cannot say eliminated—by 
the adoption of a curve which gives us the minimum of 
de-emphasis consistent with adequate high-frequency dy- 
namic reserve. Now, of course, it does not follow that if 
the inexpensive magnetic tape recorder is designed so as to 
record only to 8 or 10 ke on curve, that its playback cir- 
cuitry need be limited in this regard. You don’t have to 
build in gain in order to extend the roll-off all the way out to 
15 kc. So we can still make commercial stereotape releases 
which have substantial energy out of the 15-kc point, with- 
out requiring the inexpensive recorder to do so. If I have 
not made my point, I welcome questions from Mr. Beau- 
mont, either of the gentlemen on the panel, and later, from 
the floor. 

BEAUMONT: You rather spoiled my best argument in 
saying that you were not proposing to degrade the playback 
standard by lowering the pre-emphasis characteristic in the 
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NARTEB recording. It would be a shame to do that, because 
we know whether it be in a $200 or $2000 machine, the 
playback is perfectly capable of 15-kc reproduction. I re- 
member again the questionnaire that my predecessor in the 
sub-committee sent to the home-machine manufacturers. 
The majority of replies from machine manufacturers favored 
at least a 12.5-kc bandwidth. Some even then, over 2 years 
ago, were willing to say that they felt 7’ ips should go to 
15 kc. An equally small number felt that it didn’t matter 
if they went over 8 kc. But I don’t know whether this 
answers my objections in terms of the standard we are pro- 
posing, so I am going to sidestep answering you directly 
on that. However, there is a thought in limiting the record 
characteristic, though I wouldn’t say again, with today’s 
machines, that they are incapable of good performance 
above 10 kc. At least the particular home machine that I 
have, is. It does so very well with a standard available 
commercial head. I would certainly argue against any 
reduction in playback standards, in considering home ma- 
chines as separate from professional machines. 

SnyDER: We seem to have reached a substantial point 
of agreement. There is a point which I would like not to 
have escape us: that where economy is so important that 
5 or 8 db of extra gain in the electronics appear to be 
critical, it is unlikely there will be money for a high pre- 
cision head. 

BEAUMONT: Almost universally, in home machines, the 
record head is also the playback head, so we are not dealing 
with different structures, we are dealing with narrow-gap 
record. It’s a little different animal from the wide-gap 
record. The head itself is inherently capable of handling 
those bandwidths, presumably either in recording or playing. 

MILTENBURG: I wonder why we are even considering a 
record characteristic of well over 8,000 or 10,000 kc if the 
manufacturers are worried about the cost of the cheaper 
home machines, because I would like to know what you are 
going to record in the home that has 8 or 10,000 cycles on it. 

BEAUMONT: It’s a reasonable question. It seems to me 
that if we are going to manufacture tapes for use in the 
home we should, at 7% ips, put as much as we can on them, 
to give the tape user everything he could get on a disc and 
thus encourage him to use tape. If we restrict the frequency 
response we might just as well throw the whole thing in 
the ashcan. 

Snyper: I wouldn’t propose restricting the frequency 
response of the professional recording which is sold as a 
commercial release. I was only proposing to restrict the 
recording characteristic of the lower priced magnetic re- 
corder, in order that the same equalization characteristic 
might be built into his record circuit as into his playback. 
Certainly, it’s the thing to do to standardize it because as 
Mr. Miltenburg says, who is going to record at more than 
6 or 8 or 10,000 cycles in his home anyhow. 

MILTENBURG: The point being, how many home users 
have microphone equipment capable of going out beyond 
6 or 8 or 10 kc? 
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Beaumont: We must remember that if we take this 
NARTB recurd characteristic, where you cease to rise at 
8 or 10 kc, your playback equalization must match this. 
It makes it no longer compatible with the wide range stand- 
ard, without two positions. 

SNYDER: It would only mean the tapes which were re- 
corded on such a machine would roll out very rapidly about 
about the 8 or 10 kc selected as the cut-off point. 

Beaumont: Then you are proposing to keep the play- 
back standard what it is or might be at the time? 

SNYDER: Yes. 

MILTENBURG: There are arguments for this. Again, 
where would you record anything over 8 or 10000 cps in 
the home? 

SorENSON: The answer to that, Bill, is the same as the 
reason you might give if I were to say “Why do you bother 
giving your customers response to 15 kc on your tapes?” 
I know you do, and I know that Columbia does it on its 
tapes. You do so because the customer likes to know that 
it’s there, and the others who can tell whether they have 
it or not are gratified to get it. It’s a matter of pride and 
of respect for your customers, and for the eventual im- 
provements which you expect to see them make in their 
equipment. I think there is a point to be made here. 

SNYDER: But not in the recording made on the home re- 
corder? 


MILTENBURG: True, we as professional people should 


give the customer everything possible so as not to foul up 


the standardized reproduce characteristic. But as far as 
home recording goes, if it’s going to be a dollars and cents 
item to cut off with 8 or 10 000 cps, or eliminate electronics 
to do the job, how many cases will there ever be where 
you’re going to record anything over 8 or 10 000 cps in the 
home, regardless of whether the customer wants it or 
whether he doesn’t? If he wants it he can buy a profes- 
sional machine. 

SNYDER: It’s an extremely good question. The value of 
establishing a curve which does promote maximum suppres- 
sion of hiss, the value of establishing a curve which is iden- 
tical for home machines and for professional machines, is 
certainly great in an industry in which we have multiple 
media. We have competition among our own media, com- 
petition from television for that matter, so the value of hav- 
ing a standard is certainly great. Ed, would you add some- 
thing to what we have said? 

SorENSON: I think most of us who have been in the disc 
business can realize the similarity in some of our experi- 
ences in the past years, particularly with this trend to slower 
playing records and lower linear speeds. Looking back into 
history, we can see a parallel here between the standard 
that we are shooting for, a reproducing characteristic the 
most perfect possible, and, in the background, a restricted 
record curve which would meet all these requirements; i.e., 
following the same trend, but cut off somewhere around 
8 or 10 ke, and, if necessary, with even a little higher lift 
there, but still reproducible on the standard curve with 
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15-kc coverage. I think you'll find that there is a way of 
getting a record curve which is not necessarily stretched 
out to 15 ke but which will sound at least 90 or 95% 
equivalent, and, on top of that, with a lower degree of dis- 
tortion—which we probably would appreciate in a lower 
priced machine. I think you would find a lot of definite 
advantages in a compromise such as that. 

SnypDER: I am naturally attracted by the argument that 
you present. It certainly is true that the making of a good 
master tape at 71% ips, using the proposed NARTB stand- 
ard, using high quality microphones with good acoustic set- 
ting, and so on, requires most careful selection of levels, and 
I am sure nobody is better aware of how carefully the 7%- 
ips master needs to be made than Mr. Beaumont, Mr. Mil- 
tenburg or Mr. Sorenson, all of whom have prepared them. 
Perhaps it is asking a great deal of the semi-professional, 
the enthusiast or the rank amateur to take that steep pre- 
emphasis characteristic which we know is capable of ex- 
tremely good results, and in an uncontrolled acoustic set- 
ting, with a microphone of unknown high frequency peaks, 
to produce a tape which is free of pops and crackles which 
are due to momentary overloads, high-frequency, high- 
intensity signals. It therefore appears to me more attrac- 
tive than it did when I proposed it that we truncate the 
record curve on the semi-professional or amateur type of 
recorder, and yet preserve the playback curve, so that the 
professional recording will still playback on this excellent, 
if somewhat complicated, recorder. 

John, since you and I seem to be the center of such dis- 
agreement, as we are having tonight, please make a com- 
ment at this point! 

BEAUMONT: I have one random thought. I am young 
in the disc business but I seem to remember that there were 
many arguments over the NAB standard curve due to the 
amount of pre-emphasis—there was a great deal of dis- 
agreement in this country and in other countries, and in 
some ways, that to me, is somewhat the same as the 
NARTB standard as applied to 7% ips. We all agree that 
“the margin ain’t too great,” and we disagree to some degree 
as to the number of times program material contains high- 
frequency, high-energy material. There is an Ampex paper 
on this subject, in which the writer felt that there was 
adequate reserve in that curve. Some of us in the com- 
mercial recording practice wonder if we ran into troubles 
periodically from material which had sufficient energy con- 
tent to get us poor results at the high end, particularly at 
7%. The 15 seems to give adequate reserve for most pur- 
poses. I don’t want to enter that particular fight, yet this 
was one consideration that led us to feel that perhaps less 
pre-emphasis was saving us some trouble, and perhaps some 
distortion, in that upper part of the spectrum. It is im- 
possible of course, to consider the playback equalization as 
independent of its record predecessor. 

SNYDER: Unless you consider that there may be several 
record curves each ending in a different point on the same 
curve. 
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Beaumont: Yes, that’s one proposal that might work 
out. But this is only good for people who are willing to 
take the limitation. We cannot, in speaking for profes- 
sional practice only, consider more than one, I think. I 
was simply harkening back to the NAB controversy where 
they ran into the same problems. 


Snyper: I do feel called upon to make a comment on 
that. It is of course true that no one here is content with 
the recording methods he has. I assure you that I am not 
content with the machine we build or with the machines 
our very best competitors build. I trust that they agree 
better recorders will be built with wider dynamic range at 
the high and low end, and with wider frequency response, 
if that should ever appear necessary. It is nevertheless 
important to realize how close we stand to each of the en- 
closing extremes. To those professionals who have run into 
difficulty recording high-intensity, high-frequency sounds 
on 7% ips with the NARTB proposed curve, I would ask 
“how much oftener have you run into trouble with hiss?” 
These are the two limits between which we are caught, by 
the nature of the tape itself, and not by the nature of re- 
cording equipment. If we are running so close as to have 
trouble constantly with hiss, then it appears that we should 
be using more, not less pre-emphasis. If we are running 
much more often into trouble with splattering, due to the 
approach of saturation in tape, then we are running in the 
direction of too much pre-emphasis. I submit that the 


experience of most recordists, annoyed as they are when 
they run into splatter, is that hiss is the dominant problem, 


not splatter. Of the two, splatter is the more easily con- 
trolled, by using manual or frequency-sensitive peak limit- 
ers, or other professional devices. 

BEAUMONT: One must agree with you that hiss is our 
common problem, and to make matters worse, when you 
come to stereophonic recording the apparent hiss seems to 
rise out of proportion to any additive arrangement of two 
tracks. None of us, I am afraid, has successfully tackled 
that problem, and certainly in our business it is the worst 
one we face. It may not be entirely electrical, because you 
can measure the tracks independently and get a good noise 
figure, within the limits of the art at present. Yet, when 
you combine them in a room, the hiss rises to a much more 
objectionable level. 


Snyper: This, of course, is a problem which is going to 
plague us in preparing master tapes for stereodiscs, it’s a 
problem which has plagued us in preparing master tracks 
for stereophonic motion picture tracks, it’s one that is com- 
mon to all stereo. It indicates that our entire industry is 
in need of a recording method with still wider dynamic 
range and still greater signal-to-noise ratio. 


SorENSON: I think perhaps we have acoustical problems 
to solve too, because I do not believe the reduction of, say, 
2 or 3 db in the individual track hiss would make a very 
great subjective difference in the combined hiss. 


A STANDARD FOR LEVELS 


Snyper: Well, then let us consider that we have covered, 
if not settled, the question of tape equalization, may I ask 
for comments on how shall we determine precisely what is 
the absolute level of induction on the surface of the tape, 
that we settle the standard not on an absolute level in 
gausses or microgausses on the tape, but instead in a num- 
ber of tapes which have been most carefully compared for 
identity, one to the next, and decide that this is our stand- 
ard 250-cycle tone, and this is our standard 10 000-cycle 
tone. 

MILTENBURG: We have discussed this many times over 
lunch, and thought about it, and a standard tape is very 
much needed in the industry to pin this thing down once 
and for all so that we will have something to go on. 

Beaumont: As long as we don’t have to standardize 
signal-to-noise I see no objection. I think it sounds very 
logical, no reason why it cannot be done. 

Snyper: There is work in this direction now under way. 
A series of 13 such tapes were manufactured recently, very 
carefully compared for identity, and shipped to the Record 
Industry Association. These are being examined by its 
member companies. These have not been adopted as a 
standard, but submitted as a potential standard. If they 
meet with approval, it may be that we will have at least a 
point from which to depart, knowing in which direction we 
were going. 

BEAUMONT: May I ask if you can give a reference point 
for the electrical levels that are indicated on those tapes? 
What levels are revealed in them? 

Snyper: Everything is referred to 0.93% distortion. 

BEAUMONT: Since they are basing it on distortion, what 
does this come to on a + 4 or zero system? It comes up 
zero? You are still using that as nominal? 

SNYDER: Just as a point to depart from, because we knew 
exactly where it was. There is no reason to regard this as 
the level above which no commercial stereotapes should 
ever be recorded. It is simply a point to refer to, so that 
you might say that the maximum level on this tape is + 5 
whereas the maximum level on that tape is — 3, and every- 
body knows where you are. 

SorENSON: I do have a question. We know what line 
level we'll get, but what assurance have you that the acous- 
tic level will be anywhere near normal? 

Snyper: I can’t think of anyone more knowledgeable 
about that than you. 

SORENSON: Well, none of us in recording has ever 
reached an answer to that. But you will realize that the 
way a selection is produced, or the material in it will make 
one level bring the landlord up with a dispossess notice in 
his hands, while on the other one you will have to raise 
the gain, and your hiss problem will come back. 

Mittensurc: A standard tone does not necessarily mean 
a standard acoustic level. 

Snyper: No, nor was the standard tone which was placed 
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on those tapes, and on many other standard tapes, intended 
to give that answer. It was, however, intended to give a 
standard way of calibrating a standard meter. We are at 
least not dealing with unknown levels of distortion on peaks, 
unless we feel we may not know the ratio of peaks to aver- 
age levels. And we may not know, even after all these years 
of dealing with it, exactly what the VU indicator means 
when it momentarily reads zero. 


SoRENSON: We are coming into a new phase now, which 
means we have to more or less explore means other than 
standard tones. But by first getting a standard like that, I 
think that we would be able to get 90% of our problems out 
of the way. The other 10% we will have to live with until 
we learn to handle the first 90%, which would not be too 
rough. I think it’s a good start and I’m all for it. 


TRACK PLACEMENT 


SNYDER: Now about the business of track placement— 
it seems as if we are coming upon this subject at a time 
when it’s already settled. Yet, if there are voices to be 
raised on economic grounds or technical grounds, artistic 
grounds or other grounds, in behalf of the staggered stereo 
system I think we ought to have a chance to bring them up. 
John, do you know any reason why a manufacturer of com- 
mercially released stereotapes might be impelled in the fu- 
ture to continue to release staggered tape, or do you see the 
likelihood that all manufacturers will turn to in-line very 
soon? 


BEAUMONT: We feel that very naturally, particularly 
with the present spurt that has been given to it, and the 
availability of inexpensive in-line structures on home model 
machines, that the in-line standard will take its place in 
due course. As a manufacturer of recorded tapes, we have 
only this which we must say: our business began with 
staggered tapes at a time when there were no non-profes- 
sional in-line heads available. We must continue to manu- 
facture both in-line and staggered tape until our staggered 
business drops off to a point where we know there are not 
a significant number of customers to buy the product any 
more. Otherwise, I think we would be too soon cheating 
the many people who bought staggered machines. This is 
purely a manufacturer’s point of view, it is in no way quar- 
reling with the in-line standard which we heartily agree with. 

SNypDER: I see your point, John. Does it follow then 
that perhaps the easiest way to do it would be not to ad- 
vertise the staggered tape, or is that too delicate a question 
to ask? 


Beaumont: I don’t feel we must discriminate one way or 
the other, other people have taken stands, and people com- 
ing into the recording tape business in the last two years 
have a large enough market so that they can afford to take 
a stand one side or the other, and of course it’s been in-line. 
But we’re just going to beg off on the question for at least 
another year or two. When our volume drops, then we 
can begin to drop it bit by bit. 
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SPEAKER PLACEMENT 


SNyDER: I propose to open this discussion to the floor 
after one more question of the panel at large. 

If we are agreed that stereo information is most accu- 
rately described as angle information, does an angle of about 
60° appear to agree with studio monitor practice or is an 
angle smaller or larger than that in general usage? 


MILTENBURG: In our experience, it is usually a great 
deal less. 


SnypDER: Something like a 45° perhaps? 


MILTENBURG: Even less. 


BEAUMONT: You mean the included angle between the 
source of the two channels? 


MILTENBURG: Yes. 


BEAUMONT: Well, I have had relatively little stereo con- 
trol experience, since most of our work is done on location 
or in studios which set up their own monitor equipment. 
I would say that in general it depends upon the control— 
and I have seen some mighty queer sizes and shapes! We 
can’t always make a good solution of control. I have no 
real answer to that question. 

SorENSON: I don’t think we can answer that completely 
at the moment, because the angle is a peculiar one. In 
many cases you will find the angle as high as you men- 
tioned, or even very low. In many cases the two speakers 
would actually be beyond the wall! 


SNYDER: But there would be an apparent angle? 


SorENSON: Yes. There is a particular opening, aimed 
from the cone area out directly at you; you will probably 
find some place where they overlap. I think it is more or 
less dependent on the individual speaker geometry, as well 
as the enclosure you work with. I think that a more logical 
way to look at the problem is to establish a standard Jis- 
tening condition and not try to establish a standard re- 
cording system. 

SNYDER: It’s too early to hope for a solid, formally 
agreed upon listening situation for the preparation of stereo- 
phonic masters. Nevertheless, it appears there is a need 
for working toward that end—and with that in mind can 
you define, with any accuracy at all, Ed, what you would 
consider a reasonable standard environment for evaluating 
a stereo tape or stereo master? 

SorENSON: I was just thinking of Dr. Olson’s proposal.* 
Here we have a person well experienced both theoretically 
and practically. Might it not be good to use his proposal 
as a general starting point? 

SNypER: Mr. Miltenburg, would you suggest that Dr. 
Olson’s recommendations are a reasonable point from which 
we might depart? 


MILTENBURG: Yes, one good way to stay out of trouble. 


* Dr. Olson’s paper appears elsewhere in this issue. 
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QUESTIONS FROM THE FLOOR 


SNYDER: May we ask now for questions from the floor. 
Please direct your question to the person whom you would 
like to answer. I will attempt to repeat the question. 


Q: You mentioned recording in the home to 15 kc, and 
mentioned microphones of rather dubious characteristics, 
and room acoustics. Isn’t one of the major purposes of the 
home recorder to record FM programs or a good disc on 
tape? Here you have the complete frequency range avail- 
able, without the necessity of high class equipment. 


Snyper: The recording of live FM broadcasts is truly a 
function of importance to the purchaser of a tape recorder. 
My suggestion is only that we must anticipate having some- 
thing less than professional results from something less 
than a professional recorder, and it is exceedingly reason- 
able, indeed it is respectful of the customer’s best interests 
and greatest pleasure that if he is not in a position to possess 
a professional machine, that frequency range be the first 
point at which he limits himself. Surely he should not be 
limited by high distortion. nor in signal-to-noise ratio, which 
would cause his immediate discomfort. If he cannot obtain 
all of the desirables, the first thing to sacrifice would be 
frequency range. This is the most reasonable place to make 
an economic reduction in performance. 


Q: I would like to suggest two more things that should 
be standardized: (1) I believe it is the industry practice 
now, rather than the standard, that the upper track is fed 
to the left-hand speaker. (2) I believe you didn’t quite 
make your position clear on this angle subtended by the 
listening position. I think what you meant to say was, that 
a certain angle should be included between the listener and 
the speaker. 

SNYDER: On item 1, a valuable addition indeed to the list 
of things on which agreement can surely be reached. And 
on the second point, my suggestion is that in any given 
listening environment it is possible to arrange the loud- 
speakers so that a tone in one appears to be displaced with 
respect to a tone from the other by an angle which is meas- 
urable. This angle, I feel, might be standardized. 


Q: Ross, do you propose to put the measurement of the 
apparent angle between the observer and the two speakers 
on a geometrical basis or on the basis of apparent sound? 

SNYDER: Geometrical. 

SorENSON: I would like to break here because many of 
us have been up against the problem, and it doesn’t seem 
easy with a steady tone. Has anyone thought of a sweep 
frequency—or a wobble tone? I think we have all tried 
it, and on stereo, have had no trouble getting perspective 
or separation. I think you find it much easier. It is a very 
simple practice, and gives the engineer, with his tones, the 
closest approach to a modulated or studio signal. 

Snyper: It sounds like a valuable refinement. 

Beaumont: So far as localization is concerned, I think 


clicks, perhaps short transients, are the easiest to localize; 
at least that has been our experience in listening stereo- 
phonically. Lacking microphone equipment, this could be 
readily produced in the home by using a selected passage 
from a tape. 

Q: I have not so much a question as a comment or two 
directed at the entire panel. I am concerned, as an engi- 
neer, at the potential harm being done to the industry by 
the release of stereo tapes, because it is my feeling that 
staggered tapes are being played back with little or no 
precision, and, as you have pointed out earlier, a 1-msec or 
1-mil error in the relative gap position starts to introduce 
degradation. We do find the public buying such machines 
and tapes and as a result being disappointed in stereo. I 
personally have heard many people say that “I am dis- 
appointed in stereo—it doesn’t do anything for me—what 
am I doing wrong—what’s wrong with this tape, and so on.” 
So that I congratulate the companies that take the stand 
that their material will not be released in any form but 
in-line. Even that has its problems. As a matter of fact, 
some people tell me that they are disappointed with stereo 
in-line tapes. This arises from the fact that consumers 
slide speakers around on the floor, sometimes 4 feet apart, 
facing outward, sometimes in a corner facing parallel with 
the wall, and sometimes facing inward, depending on which 
tape is playing; all in order for it to sound right, a situa- 
tion which I hope in the next short while is overcome. 

Q: I would like to direct my information to both the 
panel and the last questioner. In calculating the time delay 
that you would get by offsetting the stagger as much as 5 
mils at 174-ips tape, this only amounts to 2.7 msec. Now I 
defy anyone to determine a time delay of this type. At 
7% ips, you would have a quarter of that time delay. 

Snyper: I appreciate the basis upon which the comment 
is made, but I believe, nevertheless, I can establish for you 
that the human hearing mechanism is perfectly capable of 
distinguishing a shift in a 10-kc tone in which the shift 
which you just discussed would be something like 40° of 
motion. The shift of a 10-kc tone, which is produced by a 
1.5-mil displacement on a 7'2-ips tape, is approximately 18 
lateral degrees of apparent position. The shift is not so 
prominent, of course, in a 1 000-cps tone, where it is only 
1/10 as much. But the shift of the 10-kc tone involves 
many of the overtones of the important instruments upon 
which localization is important. Such shift in overtones 
with respect to fundamentals causes mutilation of the 
stereophonic material, and I would like very much to have 
the opportunity to mutilate some material in the laboratory 
for you, as we have mutilated it in determining that we 
must not tolerate more than 1-mil displacement error. 

SorENSON: Our pragmatic experience says that the small- 
est displacements can be more easily determined in speech. 
Whether or not this is due to overtone structure, I don’t 
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know, but music as recorded and played these days covers 
many sins unless you are listening very critically. How- 
ever, in speech you can tell displacements in the order of 
certainly a couple of mils. 

SNYDER: It’s interesting that speech is the most critical 
material which many of us had to deal with. Everybody 
hears people talking alive, every day; very few of us have 
the opportunity to hear music live everyday. We are, per- 
haps, therefore, more sensitive to small distortions in speech 
than we are to small distortions in music. 

SorENSON: In that vein, we encounter the effect many 
times in our duplicators. I am sure you realize that we 
duplicate on two heads displaced x number of inches. The 
stretch between the tape and the heads is just the nominal 
stretch of the tape as it passes across the heads to produce 
in-line stereotape. I am sure this is on the order of a mil, 
maybe even more, but it will create a synchronization prob- 
lem. We have had many, many tapes rejected for this 
condition. 


Q: Is it possible that tape stretch causes displacement to 
develop? 

A: This is entirely possible if the tape has a tendency to 
either be badly slit giving a wrinkled effect on the top, or 
have a slight stretch as it is unwound causing track dis- 
placement, rather than alignment. 

Snyper: If it is in alignment, it is not likely under the 
worst circumstances to produce so much displacement as a 
mil. The displacement of the top gap with respect to the 


bottom which will produce a 4-db drop in 15 kc at 7% ips 


is measured in micro-inches, not mils. I do not believe 
azimuth misalignment from top to bottom of a tape is going 
to produce the sort of effect that we are discussing. 


Q: Wouldn’t misalignment bring you here a reduction of 
high-frequency signal, which you may misinterpret as phase 
differences? 

SNYDER: It may be, but this hasn’t been our experience. 
It is not from loss of high-frequency that you get a definite 
staggered effect. Quite a different mutilation of the stereo- 
phonic effect is produced when the entire gap is displaced 
longitudinally with respect to the other gap, and this is the 
effect to which I first had reference when I said that I could 
establish, I believe most interestingly to your ears and to 
those of most anyone, that a 2-mil shift would produce 
mutilation of the stereophonic effect; indeed, produce, what 
in the motion picture industry laboratories they usually 
call “swimming” of the strings. 

Q: Before I direct my original question to the panel, 
may I comment to the gentleman here. We recently were 
asked to make a master for staggered type heads. We had 
no equipment to do it so had to set up the standards our- 
selves. In checking, I found there was as much as %” 
difference between various manufacturers in the head place- 
ment. Obviously, we didn’t know where to go, so we picked 
out one manufacturer, of semi-professional machines, I 
think, the first one that came out with stereo, and we fol- 
lowed as closely as we could his head placement. Voices 
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and solo instruments are often put on two tracks, and, if 
there is any displacement at all, with very noticeable dif- 
ference in the phase they start fighting back and forth. 
This is so obvious that anybody can hear it. If you use 
one mike and spread it out like most stereo is being done 
today for solo work, the accuracy of the head placement 
should be as close as possible. In fact, I know two of the 
panel members and myself have all run into this problem 
with 3-track recorders. It’s only a matter of a mil apart, I 
am sure, between the upper and the lower track. So it is 
a very important thing. 

Snyper: I would be willing to say at this point I wish 
it were much less important. We’d have fewer gray hairs 
developed. 

Q. (Mr. Harry Bryant): I think we’re missing a point 
there on signal-to-noise ratio. This is extremely important. 
Everyone is assuming it’s high-frequency. It has been my 
observation that it covers a much wider frequency range 
than is generally accepted. The low-frequency noise is 
quite often disturbing too. Frankly I have made a few 
checks along this line and found that an increase in the 
overall recording level of 2 db results in as much signal- 
to-noise improvement as when reducing 4 db at 10000 cps. 
Of all the techniques being used today, with higher and 
higher intensities in the upper frequency range, we are ap- 
proaching a point of considerable distortion, in fact we’re 
in trouble. Occasionally, we have trouble now at 15 ips, 
and 7% splatters all over the place, if the recording level 
is kept up, especially using a Telefunken mike. Everyone 
is conveniently overlooking this, trying to heat up the 
highs, to improve the signal-to-noise. I think we have gone 
too far, to be frank with you. I would like to see a common- 
sense approach to it. I would reduce the curve in the top 
end, if we can increase the overall recording level a little bit. 


Snyper: I think probably Mr. Harry Bryant has just 
revealed 2 or 3 of the secrets which have made recordings 
from his studio among the most successful and most widely 
envied in our industry. I comment to this extent: We have 
made statistical studies of typical recorded materials. You 
are among the people who have been kind enough to lend 
us your masters from time to time, so partially through 
studying your masters we know that the typical scope of 
distribution of energy in recordings, particularly popular 
recordings, have trended during the past seven years always 
toward increasing content in the 8-kc region. There is, so 
far as our purely statistical studies can show, no tendency 
for the spectral distribution to show an increase in the 
energy above 8 kc. This indicates that our industry is 
tending toward a need for greater dynamic range in the 4- 
to 8-kc range, though perhaps not in the 8- to 16-kc octave. 
I suggest there has been not nearly enough work done on 
it. Perhaps this does not indicate so much a need for a 
change in the shape of the recording curve as it does for 
a change in the sensitivity curve of the magnetic medium. 
There is work being done in this direction. The same prob- 
lem is also evident in disc recordings. 
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SorENSON: Yes, the same problem: We have too much 
energy in the upper spectrum, and the program material is 
hot on highs; everybody is on the hi-fi kick, they want 
more and more hi, but they’ve forgotten one thing. The 
material the records are being pressed from is much better 
than 10 years ago, when the original curve was established. 
The masters are better and consequently we can use more 
highs in the playback system. Of course, the standards 
are better. Anyway, there’s nothing you can do about it 
but live with them now. 

SnypER: Some most responsible people in the disc side 
of our industry suggest that the time may be here when a 
reduction in playback de-emphasis may be appropriate in 
the disc business. I have no comment to make upon that. 

SorENSON: Our trend of thought has brought us to the 
point where no one seems to dare go any further on tape. 
Has any thought been given to “high-output” tape to meet 
some of these requirements? Is the cost factor so great 
that you couldn’t use that to gain something? 

A. (from floor): Most people overlook the fact that in 
disc recording you can actually record levels at 15 ke 
equivalent to the maximum at 1000 cps on 7'%-ips tape. 
We can only record on tape at 15 ke at a level which is 15 
to 20 db below the level at 1 kc. The problem of over- 
loading is approximately 15 db worse on tape than it is on 
disc, and it has been my experience that most release tapes 
have a rather large amount of “pucks” and similar noises, 
and occasional overloads of high-frequency. So I would 
not be very happy on standardizing on a curve that is so 
hot on top. 

Snyper: I am certain that none of the tapes to which 
you have reference were made by either of the three firms 
represented here, therefore we may speak freely. Those 
effects are due to inadequate level control during recording. 
I presume that in saying you can record at 15 kc a tone of 
the same absolute output level as the 1-kc tape tone, you 
were talking about an acetate master without pre-emphasis, 
and you were talking about the outer grooves of the disc, 
may I ask if this is so? 

A. (from floor): That is correct, but I was talking of the 
commercial practice using the RIAA recording character- 
istics. 

SNYDER: I would like it explained to me how it is that 
a pre-emphasis system so favors the high end that we can 
place a higher absolute level on the disc at 15 ke than on 
a tape. I regret to say that my knowledge of this subject 
does not permit me to make a further comment. I would 
very much like other members of the panel, all of whom 
have far more experience in this than I do, to comment 
upon it. (No answer.) That leaves me with complete in- 
ability to reply! 

Q. (Col. Richard Ranger): I am very interested in what’s 
going on here. There are some points here that I’d like 
to get at a little more definitely. 

(1) We are interested in signal-to-noise ratios, but after 
all the main question is what the listener will get out of 
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the recording. In other words, is it pleasing? I wish we 
could have some kind of instrument that could determine 
that. We had a wonderful demonstration this afternoon 
of stereophonic sound, but to many of us it was entirely 
too loud and I think that you have explained that in telling 
us what you had in your control room. It may well be nec- 
essary but it certainly wasn’t comfortable for the people 
here. 

(2) My work has been for the last few years, chiefly, in 
motion picture sound. There we always look for 6000 
cycles. If you get good clear 6 000 cycles, you’re in busi- 
ness and I would suggest to you all that you listen, without 
any instruments, to 6 000 cycles on your equipment. 1 have 
reached the stage where I can’t hear 15 kc to know whether 
it’s really good or not, but try 6, won’t you? There is a 
whale of a difference between different machines on 6 000 
cycles. 

(3) Now about this phase business: Our ears are so 
built that we physically cannot tell phase differences of the 
orders of which you are speaking at high-frequencies. You 
can tell intensity differences, but not phase differences. 1 
kc is the cross-over point. Below 1 kc you respond to phase 
differences, and I agree with you that in speech it is most 
important, but it is intensity, entirely, above 1 kc. 

SnypDER: Would you suggest, Col. Ranger, that very 
small shifts in phase affect tones below 1 000 cycles audibly 
then? 

RANGER: Yes, they do. They definitely do. 

Snyper: Colonel, you speak with authority on this sub- 
ject. 

Q: Col. Ranger, isn’t it true that almost all manufacturers 
of motion picture equipment are now using a 10-kc low-pass 
filter in magnetic recording? 

RANGER: Yes. 

Q: Isn’t it true also that all dialogue has been recorded 
for the past 20 years using low passes at 6750 cycles? 
Music at 8500 cycles? 

RANGER: Correct. 

Q: Where did the 6 000 cycles come from? 

RANGER: I am just asking you to listen to it. Get your 
machines to sound good, and I think that just automatically 
you will improve your operation. 

MittensurG: Col. Ranger, you were talking about the 
level of the stereophonic demonstration this afternoon. I 
believe that you will find levels of this magnitude quite 
average for theatre type installations. 

RANGER: Yes—but the audience today was a lot closer! 

MILTENBURG: This may be possible. We were trying to 
accomplish something today. 

RANGER: You did! (Laughter.) 

RANGER: Now may I ask the panel. Another question 
that I would like to have settled, or at least propose the 
settlement for, is the size of the track, the position and the 
spacing. 

Snyper: This would be a track of nominally 100 mils, 
which might be permitted to vary downward as far as per- 
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haps 85. It might be of value to permit the track to 
become as narrow as 85 to permit inter-track separation for 
high crosstalk rejection. 

RaNcER: Isn’t it important in stereophonic work to set 
a pretty definite limit? 

Snyper: I think it might be extremely important that 
we specify that the heads be capable of covering the outer 
85 mils. 

RANGER: Do you go to the edge of the 4” tape? 

Snyper: Yes, in recording. 

RANGER: Well, that’s very good, but have we a standard 
in that direction? 

Beaumont: The MRIA is working towards a proposed 
standard for stereo track width which allows some 90 mils 
of separation plus some 15 mils of separation to the edge. 
There is some question about this still, since Ampex, at 
least, feels that no more than 80 is necessary for required 
rejection. We've gotten at least that far. 

Snyper: Colonel, I would like to interpolate a comment 
about levels—I have been responsible for many demon- 
strations, both stereo and otherwise. My standard for 
determining when the level is right is to find that loudness 
at which half the people say it’s too loud and half of them 
say it’s not loud enough. 

Voice: We use the theory that when your eyeballs light 
up you know it’s loud enough. I would like to elaborate 
on what John Beaumont said on the MRIA establishing 
standards. They’ve gone as far as to establish standard 
reels, both in the reel width, the hub, diameter, outside 
diameter, the track spacing on full track; half track, and 
stereo, and established azimuth accuracy of the two chan- 
nels on stereo, the position of the heads for half-track, and 
have initiated a new committee for test measurements which 
we felt was very important because everyone was in agree- 
ment as to how to establish these. John, of course, you had 
one on equalization, and I think they covered all the things 
that you mentioned, Ross, except speaker placement and 
reproducing level. I think it might be worthwhile to com- 
bine these two into one standard and present this to the 
industry, since MRIA is composed of nine manufacturers 
of commercial tape recorders. These are the machines 
people will use with pre-recorded tapes. 

SNYDER: It is because Mr. Beaumont is a member of 
the AES Standards Committee and the MRIA Standards 
Committee that he was invited to join our panel tonight. 
What you suggested is our aim. John, would you make an 
additional comment? 

BEAUMONT: I’m simply going to say that it’s been a 
source of sorrow for me that the Society was not first in 
proposing standards. I think it is an excellent idea now 
that at least one body is under way and we can now com- 
bine our efforts and work towards as much standardization 
as we can get in these matters at least. 

Snyper: I add, of course, that the RIAA must be 
brought into consideration, must be fully advised of all of 
the findings and recommendations of the bodies represent- 


ing AES and MRIA. After all, the RIAA represents most 
of the people who will be manufacturing the tapes which 
are released and the stereo discs which are later released, 
and in which we have many things in common. 

Q: To return briefly to the subject of pre-emphasis, I 
think it might be interesting to hark back to the old NAB 
equalization controversy. I think that controversy would 
have been along slightly different lines had we had at the 
time better pickups to play the records. We now know 
that if we go back to records which 10 years ago seemed 
overly pre-emphasized, and play them with the modern 
transcription pickup, we find they weren’t nearly so bad 
as we thought they were. I wonder if you gentlemen would 
consider that future changes in tape and possibly in play- 
back equipment might make pre-emphasis greater than now 
adopted more bearable in the future. 

Snyper: There is always the danger in establishing 
standards that we would establish them at a point at which 
progress would be limited rather than promoted. Of course, 
no standard which is set is utterly unshakable. The posi- 
tion I have taken tonight is that we should positively not 
move in the direction of less pre-emphasis and that we 
might consider moving in the direction of more pre-emphasis 
of the top end. This is not a position that everybody in 
the industry supports, however. I have given my reasons 
for supporting it. Mr. Beaumont very ably defended the 
alternatives to that position. 

Q. (Gordon Murphy): Ym not currently concerned in 
any way with the engineering aspects of this problem, but 
I am trying to sell pre-recorded tape. I must agree that 
the hiss problem just absolutely overwhelms all others in 
connection with attempting to sell to the general public. I 
have dealt with hi-fi fans to the sophisticated type and I 
have found them exceedingly unreceptive to the signal-to- 
noise ratio which they are given, in respect to hiss pri- 
marily. Particularly with respect to women and hiss, it’s 
been quite disastrous from the sales point of view. 

Voice (from floor): Most of the hiss just referred to was 
on reproduced copies. While it is a factor in the original 
recording, it is not quite as bad, apparently. It is on copies 
which have been dubbed 3 or 4 times from the original, 
that the noise is bad. 

SNYDER: We have some studies which indicate that if 
all conditions are most carefully controlled it should be 
possible to make several generations with increase in noise 
not exceeding 2 db average. Too often, I have found that 
the customer, who complains bitterly about hiss has a tape 
which has been run over a magnetized head and he’s most 
reluctant, as I would be in his position, to admit it was his 
own carelessness which caused the tape to become noisy. 
We must guard ourselves against assuming that all com- 
plaints of hiss are based upon the irreducible minimum of 
hiss which we can achieve. 

SorRENSON: I might comment that in our manufacturing 
experience, using Ampex high speed duplicators, we can 
stay within the 2-db degradation in the duplicator cycle; 
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something which surprised us until we measured it. 
MILTENBURG: That’s been our experience too. 
BEAUMONT: 2 db, sometimes less, and sometimes slightly 
more. Certainly, it requires careful quality control. 
Snyper: If this is achieved, you may be certain that 
the company achieving it is carefully controlling every step 
of the operation. 
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With that, then, we will bring this Symposium to a close. 
We thank you all for staying up until this late hour near 
the end of the Convention, and we’ll see you all again next 
year, when we may be able to report that some of these 
standards are standards in fact rather than proposals. 
Thank you kindly. 


Letter to the Editor 


Short contributions which, by reason of timely interest and/or controversial nature, should be published immediately 
without going through the usual review channels will be published in this section. We hope others will be encouraged to 


submit similar contributions. 


Subjective Effects of Frequency 
Modulation Distortion* 


Pau W. KuipscH 
Klipsch and Associates, Hope, Arkansas 


INTRODUCTION 


HIS IS AN evaluation of a definite amount of frequency 
modulation distortion, and its relation to speaker output 
and subjective response. A signal was frequency modulated 
to the extent of 0.35% rms; the modulating frequency was 
10, 20 and 40 cps. The modulated signal was either sinu- 
soidal or with steep sides and bulged tops approaching a 
square wave. 
Listening tests for subjective evaluation of this distor- 
tion indicated the amount was intolerable. 


METHOD 


A thimble for a tape machine capstan was made by care- 
fully boring the sleeve to capstan size, offsetting the 4-way 
independent chuck in the lathe exactly 0.0014” and then 
turning the outside diameter to 0.280”. Slipping this thim- 
ble over the capstan results in a 1% peak-to-peak flutter, 
or 0.5% peak flutter. Expressed as a rms value the flutter 
is 0.35%. 

Capstan speeds available were 10 and 20 cps, and by 
copying the tape from 15 to 7% ips and playing back at 
15 ips, a flutter frequency of 40 cps is obtainable. 


RESULTS 


A sine wave of approximately 1046 cps exhibits a fast 
“vibrato” at 10 cycles flutter. The effect at 20 cycles be- 
comes unpleasant, and at 40 cycles extremely irritating. 

A wave with the steep sides of a square wave but with 
tops rounded exhibits an oboe-like tone. A 10-cycle flutter 
of this sound has no counterpart in music, though the effect 
is not particularly unpleasant, but at 20 and 40 cycles the 
irritating quality is multiplied over that of frequency modu- 
lation of a sine wave. 


* Received April 3, 1958. 


SIGNIFICANCE 


The method of creating the distortion is such as to avoid 
other forms of distortion. That is, frequency modulation 
is the only appreciable distortion, and is unaccompanied by 
amplitude modulation. The wave form depicted on the os- 
cillograph is clean and devoid of any trace of amplitude 
modulation. However, played through a speaker and micro- 
phone, the oscillograph shows amplitude modulation due to 
the standing wave interferences. Presumably the auditory 
sense also “discriminates” to form amplitude modulation 
from the pure frequency modulation. But the point is, here 
is a method to produce pure frequency modulation distor- 
tion without other forms of distortion, such as would take 
place in a perfect, distortion-free speaker producing two 
tones simultaneously. 

It serves as an evaluation of the severity of amounts of 
frequency modulation as low as 0.35%. 

This amount of distortion would be produced by a 12-inch 
loudspeaker (10-inch cone) performing an amplitude of 
0.21” at 50 cps. 

To determine subjective limits of “tolerable,” “permissi- 
ble” or “irritating” distortion levels would require capstans 
of varying degrees of eccentricity and different frequencies 
of rotation. 

From the present available data the conclusions are: 

1. 0.35% frequency modulation distortion is increasingly 

irritating as the modulating frequency is increased. 

2. At low modulating frequencies the wow is beyond 

tolerable levels. 

3. At medium modulating frequencies (5-10 cycles) the 

result is introduction of artificial “vibrato.” 

4. At higher frequencies the quality becomes harsh as the 

modulating frequency is raised. 

Translation of the results into speaker output and the 
resultant subjective effects can be direct. However, attempts 
to correlate the two always result in the speaker sounding 
worse than the pure or synthetic distortion, because the 
speaker frequency modulation distortion is always accom- 
panied by amplitude modulation distortion, the same causes 
usually contributing to both. 
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TV musical background—wide flexibility! 


2 instead of 10,000 cubic feet - compact! 
Leading users of various models: Movietonews, National Film Board, WCFL, WFIL-TV, 
WGBH-TV, KCMO, WSB-TV, Australian Broadcasting System, Belgian Broadcasting Sys- 
tem, Canadian Broadcasting Corp., Decca Records, Dumont, Howell Recording Studio, 


RCA Mexicana, Olmsted Sound Studios, Pathe-Marconi, RCA Far East, RCA Victor, Radio 
Free Europe, Radio Recorders. 


JUST RELEASED ! 
New Model 44A 


features 9 heads, individual head switching and sliding heads for 
infinite flexibility, variable equalization for special effects. Net 
price—$1485. 


pays for itself in 3 to 5 months 


leading manufacturers of quality audio equipment use 


« 
ail intermodulation meters 


Flexibility, low cost, exceptionally low residual intermodulation (leakage). 


Used by such leading equipment manufacturers as Admiral, Ampro, CBC, Boeing, Bogen, 
DuMont, Fairchild, GE, Langevin, Marantz, Radio Craftsmen, RCA, Seeburg, Sonotone, 
Standard Transformer, Stromberg-Carlson. 


Do you know about our new 
Model 168, just released? 


Write for new Catalog A-1 @ | | he ‘ 
audio instrument co., inc. 


/ 133 West 14 Street, New York 11, New York 
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Look... only the Hlxvalve 
has 100% 


Choice of the best phonograph pickup can only be resolved by comparison! 
What is the yardstick? . .. How can you tell? . .. What do you look for?.. . 
the answer is 100% Important Quality Features*! 


PICKERING has had long experience in the cartridge field, supplying the 
finest quality products for recording studios, broadcast stations, wired music 
services, and high fidelity home music systems. As a result of this extensive 
experience, PICKERING has developed the FLUXVALVE .. . the one car- 
tridge which incorporates all of the Important Quality Features* so 
necessary for high fidelity reproduction from records. 


DECIBELS 


Before you choose a cartridge... : | 
LISTEN AND COMPARE... demand 100% IQF*! MREQUENCY IN CYCLES PER SECOND 
CARTRIDGE A 


Cartridge 


ali Be 


ts 6 
na § 
Is EUAN? adhe 


Frequency Response: Flat | 
20-20,000 cps + 2 db 
(see curves on right) 


YES 20 Points 


NO 0 Points 


NO 0 Points 


NO 0 Points 


Low Tracking Force, 
2-4 grams 


YES 20 Points 


NO 0 Points 


YES 20 Points 


NO 0 Points 


High Output, No 
Transformer Required 


YES 10 Points 


NO 0 Points 


NO 0 Points 


YES 10 Points 


Replaceable Styli 


YES 10 Points 


NO 0 Points 


NO 0 Points 


YES 10 Points 


Y2 Mil Stylus 


YES 15 Points 


NO 0 Points 


NO 0 Points 


NO 0 Points 


One Cartridge For 
LP’s and 78's 


YES 5 Points 


NO 0 Points 


NO 0 Points 


YES 5 Points 


Anti-Hum Design 


YES 10 Points 


YES 10 Points 


YES 10 Points 


YES 10 Points 


Hermetically Sealed 
TOTAL POINT VALUE 


YES 10 Points 
100% 


NO 0 Points 
10% 


NO 0 Points 
30% 


NO 0 Points 
35% 


: 


DECIBELS 


FREQUENCY IN CYCLES PER SECOND 


pecrerts 


% 


IMPORTANT 
QUALITY 


FEATURES 


—so 

THE FLUXVALVE...chosen time and again as the top cartridge solely on the necessary 
basis of listening quality . .. by panels of qualified experts . . . tests which have eS a 1 BTU t 
proven that it is actually less costly to own a FLUXVALVE Coc con val 

The FLUXVALVE preserves the quality and prolongs the life of your record LE EHP Heth high 
since there is complete absence of resonances throughout the audio frequency FREQUENCY IN CYCLES PER SECOND fidelity 
range. 

It may interest you to know that the FLUXVALVE, because of its ability to reproduction 
make precise and reproducible record measurements, is used for calibrating from 
records. 


recording channels and record masters. 


Peaks and/or resonances in the stylus 
assembly at any recorded frequency will 
distort; and, damage the record groove. 
Therefore, any deviation from flat response 
over the recorded frequency band results 
in eventual breakdown of the groove wall. 
Deviations of from 3-6 db distort the rec- 
ord material as much as 60-100%. 


Make the IQF* test today ... listen to your favorite record 
reproduced with a FLUXVALVE ...the gentle pickup. 


Series 1940 UNIPOISE Pickup Arm 
with FLUXVALVE Cartridge. 


Model 194D with 1 mil diamond “T-Guard”™ stylus $59.85 
Model 194D with 2.7 mil diamond “T-Guard” stylus 59.85 
Mode! 194 5D with ¥2 mil diamond “T-Guard™ stylus 65.85 


SERIES 370 SINGLE FLUXVALVE 
1 mil Sapphire 17.85 
2.7 mil Sapphire 17.85 
1 mil Diamond 29.85 
2.7 mil Diamond 29.85 
¥Y2 mil Diamond 35.85 


SERIES 350 TWIN FLUXVALVE™ 
350-00 1 mil Diamond 
350-.5D0 2 mil Diamond 
350-DS 1 mil Diamond /2.7 mil Sapphire 
350-00 1 mil Diamond /2.7 mil Diamond 
350-0.50 1 mi! Diamond 2 mil Diamond 


SERIES 3500 “T-Guard” STYLI** 
$-3510 1 mil Sapphire 
$-3527 2.7 mil Sapphire 
0-3510 1 mil Diamond 
0-3527 2.7 mil Diamond 
0-3505 2 mil Diamond 


**Other stylus radii available on special order 


for those who con the UUELENCE  eme quaury men eweury pRooucts ey 
PICKERING & COMPANY, INC., Plainview, N. Y. 


AT LEADING HIGH FIDELITY SHOPS EVERYWHERE. WRITE FOR NAME OF NEAREST DEALER TO DEPT. S-48. 


“Available in many other combinations of styli 
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TEST RESULTS 


ranscription turntabl 


TESTED: for performance by Audio Instrument Company, Inc., an independent laboratory. 


rResucts: Garrard Model 301 tested even better than most professional 
disc recording turntables...sets a new standard for transcription machines! 


gy Read Mr. LeBel’s report below g 


Gentlemen: 

We have tested the three 
Garrard Model 301 Turntables 
which the undersigned selected at random from sealed unopened 
cartons in your warehouse stock. These three bore the following 
serial numbers: 867, 937, 3019. We used a standard Model WB-301 
mounting base without modification, a Leak tone arm fitted with 
their LP cartridge, and a complete Leak preamplifier and power 
amplifier, model TL/10. 

Pickup and amplifier system conformed in response to the 
RIAA-new AES-new NARTB curve within + 1 db. 

Standards referred to below are sections of the latest edition, 
National Association of Radio & Television Broadcasters Recording 
and Reproducing Standards. Our conclusions are as follows: 


3 Stock machines 
selected at random! 


P : Measurements were made in 

cymenaenes * protec accordance with NARTB spe- 

cification 1.05.01, using a stro- 

boscope disc. In every case, speed could be adjusted to be in 

compliance with section 1.05, i.e. within 0.3%. In fact, it could 
easily be adjusted to be exactly correct. 


WOW less than Measurements were made at 

NARTB specifications! 33% rpm in accordance with 

NARTB specification 1.11, 

which calls for not over 0.20% deviation. These values substantially 

agreed with those given on Garrard’s individual test sheets which 
are included with each motor. 


Garrard Serial No. % 
867 17 

.13 

12 


Rumble less than Measurements were made in 


most professional accordance with sections 1.12 

. 1 and 1.12.01, using a 10 to 250 

recording turntables! ome tend oun He, ond 6 40 

meter for indication. Attenuation was the specified 12 db per 

octave above 500 cps and 6 db per octave below 10 cps. Speed 
was 33% rpm. 


C4 
BLC. Now there’s a Garrard 
a= et for every high-fidelity system 
5s yy 3 Qs. 


Lo Ress aci2zyn 301 
Super Changer Deluxe Changer Mixer Changer juratable 
967.50 $54.50 $42.50 "yon00 


Signal to Rumble Ratio Using 
Reference Velocity of 7 cm/sec 
at 500 cps 

This reference velocity cor+ 
responds to the NARTB value 
of 1.4 cm/sec at 100 cps. 


Rumble: checked by 
official NARTB standard 
method 


(—35 db. min.) 
—52 db.! 


Garrard Serial No. DB 
867 52 

937 49 

3019 49 


The results shown are all better than the 35 db broadcast repro- 
ducing turntable minimum set by NARTB section 1.12. In fact they 
are better than most professional disc recording turntables. 


Signal to Rumble Ratio Using : 
Reference Velocity of 20 cm/sec eee tll gm by 


s 
at 500 cps methods —61 db.! 


Garrard Serial No. 
867 

937 

3019 


We include this second table 
to facilitate comparison because Rumble: omy by 


Manuf Saas 

some turntable manufacturers 

have used their own non-stand- [MaAbeAibwerd —84.1 db.! 

ard reference velocity of 20 

cm/sec, at an unstated frequency. If this 20 cm/sec were taken at 

100 cps instead, we would add an additional 23.1 db to the figures 

just above. This would then show serial number 867 to be 84.1 db. 
It will be seen from the above 


that no rumble figures are ae cae importance! 
meaningful unless related to the these 
reference velocity and the ref- [idghieMibehi URS evaluate 
erence frequency. Furthermore, [MRedeAMbUDMe aD eae LE 
as stated in NARTB specifica- 
tion 1.12.01, results depend on the equalizer and pickup character- 
istics, as well as on the turntable itself. Thus, it is further necessary 
to indicate, as we have done, the components used in making the 
test. For example, a preamplifier with extremely poor low frequency 
Tesponse would appear to wipe out all rumble and lead to the erro- 
neous conclusion that the turntable is better than it actually is. 
One other factor to consider is the method by which the turntable 
is mounted when the test is made. That is why our tests were made 
on an ordinary mounting base available to the consumer. 


CT. Cob 


AUDIO INSTRUMENT COMPANY, INC. Cu. LeBel 


Write for free Illustrated Literature. Dept. GD-478, Garrard Sales Corp., Port Washington, N. Y. 
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A Special Invitation to Members of the 
AUDIO ENGINEERING SOCIETY 


The next three monthly issues of High Fidelity for only $1 
- + . saves you 44% of the regular single copy price! 


A good percentage of the members of the Audio Engineering Society are already 
HIGH FIDELITY subscribers. But if you are not now receiving copies regu- 
larly .. . if HIGH FIDELITY is, perhaps, just another name on the sustain- 
ing list of the Society ... this is your opportunity to get acquainted with 
the world’s first and leading hi-fi magazine at a considerable saving. In your 
three trial issues, you will enjoy: 


* ‘'TESTED-IN-THE-HOME-REPORTS .. . Covering everything 
from a cartridge to a complex speaker system. You'll find all the 
manufacturer’s specifications included, too. 


PLUS 


* EQUIPMENT IN THE MAKING ... Exclusive previews of 
new devices being developed in the laboratories — devices of in- * 


More record coverage (LP’s and tapes) than any other magazine 
terest and importance to everyone in the audio field. 


in America: classical, popular, jazz, the spoken word, demon- 
strators, etc. 

NEWS FROM ABROAD .. . Progress reports on what’s happen- 

ing in the record and sound laboratories of Europe that will affect * 


Behind-the-scenes close-ups of important figures in the music and 
audiophiles in the U. S. 


audio fields. 


* THE FUTURE OF STEREO .. . The latest details on the de- bg 


Special discographies listing and ranking all the LP recordings of 
velopment of stereo equipment, tape and disc. 


a single composer or artist. 


THERE’S NO BETTER TIME TO BE READING HIGH FIDELITY THAN NOW, WHEN SO MANY 
IMPORTANT NEW DEVELOPMENTS ARE TAKING PLACE IN THE FIELD OF SOUND REPRODUCTION. 


= 
I 
! 


| SPECIAL $1 TRIAL OFFER 

Please send me the next three issues of HIGH FIDELITY, for which I am 

! enclosing $1.00. (Regular single copy price: $1.80) 
FILL OUT | ie tae pon ee 
THIS Sp ee ieierter rr 
oo re em La ae 


TODAY! 


Mail to: HIGH FIDELITY, 20 Publishing House, Great Barrington, Mass. 


el 


‘J 
ee i 
z 
* 
Yr. 
t 
ee : 
A 
q 
- 
BP | 
BO 
CNet eee : 
ee 
———— OO —“C:isSCSC‘CSCiés 
‘ 


ey ae eer ene ye 


Gin, Gai ha 


Be sure to mention the JoURNAL in replying to our advertisers. 


high fidelity phonograph cartridge until 
you ‘ve e read this cad 


oe ees 


SPECIFICATIONS THE MISSING LINK to popular-priced stereophonic sound 


RESPON: 
ELEMENTS: 


ISE: 20-16000 cps. +2.5 db to RIAA has been found: It’s the new Electro-Voice TOTALLY COMPATIBLE 


(Westrex 1A) .5 volt rms. Stereo Cartridge . . . plays the new stereo discs superbly .. . LP’s too... 


OUTPUT: 
COMPLIANCE: 2. x 10*CM/dyne 


even better than existing cartridges. 


By breaking the stereo cartridge cost bottleneck, Electro-Voice has made 
popular-priced quality stereo a reality. E-V’s ceramic stereo cartridge 
(Model 21D with .7 mil diamond stylus) sells for only $19.50 (Audiophile 
net) and is available now at your audio dealer or from your serviceman. 


Here are some of the answers to your questions concerning stereo: 


Q How does the COMPATIBLE E-V Stereo Cartridge 
differ from CONVENTIONAL cartridges? 


A It has the ability to play both the new type stereo- 
phonic discs and conventional records. Inherent in its 
design is an improved monaural performance. Exclusive 
design for rumble suppression of 15 db or better will 
permit the use of Electro-Voice’s Stereo Cartridge with 
any type of changer or transcription player! 

Q Are stereo discs compatible with conventional car- 
tridges? 

A Most cartridges damage the stereo record. DO NOT 
BUY STEREO DISCS UNTIL YOU HAVE AN E-V 
STEREO CARTRIDGE. You may then play monaural 
or stereo discs monaurally. Add a second speaker and 
amplifier, and you have stereophonic sound. 


Q What about modification problems? 


A Using an Electro-Voice Stereo Cartridge, which is 
constructed so that its output is already corrected to 
the RIAA curve, you will not require the equalization 
of the second amplifier. Inserting the cartridge is simple. 
It will fit virtually any standard tone or transcription 
arm. The addition of a second amplifier and speaker is 
not complicated. 


Q What about record availability? 


A Recordings by major record manufacturers will be 
available in mid-1958. 


Q What effect will stereo cartridges and records have 
on your present equipment? 


A Only your cartridge will be obsolete. All other com- 
ponents are compatible with stereo. 


Q What if you don’t have a HI-FI system now . . . should 
you wait? 

A No. Proceed as before—with one exception: you should 
insist on a stereo cartridge initially. When you are ready 
for stereo, merely add a second speaker and amplifier. 


Q How do you go about getting your Electro-Voice 
Stereo Cartridge? 


A Visit your dealer. If you don’t know the name of 
your nearest dealer, please write Electro-Voice. Ask for 
E-V Stereo Model 21 D with .7 mil diamond stylus or 
E-V Stereo Model 26 DST Turnover with .7 mil dia- 
mond Stereo tip and 3 mil sapphire tip for monaural 
78 rpm records ($22.50). 


STERI _ oF IS HERE! 


EREO 


don’t buy an obsolete cartridge... replace with the 
totally compatible Electro-Voice stereo cartridge 


ELECTRO-VOICE, INC. 


fame / 7 ore: oe fe ea... MICHIGAN 


E-V of Canada Ltd., 1908 Avenue Road, Toronto, Ontario 
"19 Bast 40th Street, New York 16, U.S.A. Cables: ARLAB 


World’s leading manufacturer of icsaihiands, Cartridges, High Fidelity Speakers 


and Enclosures, Professional Electronic Instruments and Public Address Speakers. 
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You’re looking at an ant’s-eye view of a diamond 1. An artist of the first rank has given an exceptional 
needle in a record groove. It’s magnified 250 times performance. 

to illustrate the enormous margin for error in the 2. This performance has been flawlessly recorded by 
playback of an ordinary recording. Capitol’s creative staff and sound engineers. 

But the symbol next to it is never put on ordinary 3. It has been judged by the record-rating “Jury” as 
recordings. It reads “Full Dimensional Sound’”’ and being worthy of the “FDS’’ seal—denoting the 
when you see it on the upper right hand corner of a highest fidelity known to the recorder’s art. 
Capitol album you know— No other symbol promises so much. And delivers it. 


Incomparable High Fidelity—Full Dimensional Sound Albums 


CLASSICS. 
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AIRCHILD | 


“THE SOUND OF QUALITY” 
Fairchild —the “First” company is again first with high quality stereo disk playback equipment. 
Fairchild proudly presents a complete Stereo System 
of comparable quality for home or studio. 


282 STEREO ARM. Embodying 
refinements and improvements 
in the world-famous 280 Arm, 
this arm is especially stiitable for 
oA STEREO use. 

* Takes XP-4 and other FAIRCHILD cartridges as plug-in. Uses contact 

blocks for cartridge pins. 

* Takes all other FAIRCHILD cartridges as direct plug-in. Many other 

cartridges easily adaptable with special connectors supplied. 

* Improved spring counter-balance, adjusts easily to stylus force. 

* Improved pivot action, for lower pivot friction and hysteresis. 

* Dual output leads, separate ground lead insulated and isolated. 

* New easy mounting. 

* Lowest tracking error. Carefully designed for optimum performance at 

all stylus positions. 


248 STEREO PREAMP. The 
latest in high quality electronics 
from FAIRCHILD. This unit com- 
bines extreme simplicity of oper- 
ation with equal accuracy of 
equalization and flexibility of 
control. Provides: 

* Two completely independent preamplifiers, with individual bass, treble 
and level adjustments for each channel. 

* Single-knob overall gain control which preserves individual channel 
adjustments and balance. 

* Overall loudness contour control in addition to present balance. 

* Monaural-stereophonic switching which electronically cancels vertical 
cartridge output for use with standard LPs. No improvised switching, no 
compromise playback, no hum or other low switching problems. 

* Tape head input equalized for 15, 742 or 3% ips. Superb quality, S/N 
(measured or audible) better than many professional tape playbacks. 
Perfect for tape stereo as well as disk. 

* Changer input incorporating rumble filter and RIAA equalization. 
Makes system adaptable to any combination of turntables and changers. 
* Full Year Guarantee against any failure due to defects in components 
or workmanship. Premium components and high safety factors used. 


255A POWER AMPLIFIER. With the advent 
of stereo, space will in many cases be at a 
premium. The compact, high performance 
255A will be welcomed as a second channel 
amplifier to convert a monaural system to 
stereo, or in any other application where 
high performance is required. 
* 30 watts RMS (60 watt peak) continuous 
power output at all frequencies from 20 to 20,000 cps. 
* Extreme compactness — size only 92x6"x6%4” high, weight, 17 Ibs. 
* Easily driven by any preamplifier. 0.7 volts input delivers 30 watts out. 
* Extremely stable. Will not oscillate under any input or load conditions. 
Quick recovery from any surge or overl 
* Triode-like quality. Output stage operated to produce minimum higher 
order and odd harmonics. 
* Built for continuous duty. Conservatively rated. 


XP-4 STEREO CARTRIDGE. Utilizing all the 
skill and “know how” of FAIRCHILD crafts- 
manship, this advanced cartridge is of true 
moving coil design and preserves the same 
high standards of performance for which 
FAIRCHILD cartridges have long been noted. 

These features make the XP-4 the best buy in STEREO. 

* Negligible record wear due to very high vertical compliance. 

* No hum loops because of four wire output. Unless the two ground leads 

are insulated from each other, ground loops and annoying hum usually 

result when a stereo caftridge is connected to two preamplifiers. With the 

XP-4, no such trouble need be encountered. 

* Full frequency channel isolation. Separation is more than adequate for 

full stereo effect throughout the entire audio spectrum. 

* Unique pivot construction provides equal freedom of motion in all 

directions in order to follow two-dimensional complex wave forms with 

minimum distortion. Approximately equal mechanical impedance for any 

direction of stylus motion. 

* Specially contoured and formed stylus arm provides minimum weight, 

maximum rigidity for all directions of motion, resulting in greater accuracy 

of motion transfer from stylus to coil, and less distortion. 

* Standard size and weight. Makes adjustment of tone arm simple. Mount- 

ing is easy in either RETMA standard (% inch), or new EIA (%i6 inches) 

mounting centers. 

* Second channel adapter and leads furnished for use in standard arms. 

* May be used for excellent results on standard LPs without change of 

any kind. But for cancellation of vertical response, connection of coils in 

series gives twice the output voltage and excellent vertical rejection. 

* Adequate output. 3 mv each coil, 6 mv if connected in series for 

playing standard LPs. 


FAIRCHILD 412 series TURNTABLES 
with exclusive two stage belt drive. As 
beautiful in appearance as they are in 
performance, these tables are perfect 
for use in the studio or in the home. 
They are ideal for STEREO because 
of their near-zero rumble characteris- 
tics. Any of these turntables exceeds all NARTB specifications for 
professional applications. They are available in 1, 2 and 4-speed 
versions, the latter only in conjunction with the FAIRCHILD ELEC- 
TRONIC DRIVE. All have hysteresis synchronous motors. 


412-1 —one speed (33%) table convertible at any time to 412-4 by the 
addition of the 412 E/D Electronic Control Regulator. Ideal for a majority 
of applications. 


412-2 —two speed (33% and 45) table, utilizing ingenious belt shift 


mechanism. No pucks or rubber wheels to develop flats. The ideal table 
for all of today’s modern recordings. 


412-4 —4 speed unit with variable frequency speed control of hysteresis 
motor. Operates perfectly from any AC power source. Provides four inde- 
pendently controllable speeds and faultless operation. 

Any of these FAIRCHILD components may be mixed with other makes, 
or, used as an integrated system, they provide the best possible trans- 
lation of the recorded groove into loudspeaker performance. They 
produce the Sound of Quality. For further information write: 


FAIRCHILD RECORDING EQUIPMENT CORPORATION « 10-40 45th Avenue, Long Island City 1, N. Y. 


__}-— 
— 


Be sure to mention the JouRNAL in replying to our advertisers. 


This recorder takes on all assignments 


This recorder is an Ampex Multi-Channel. And with it you can handle every audio recording assignment that 
comes your way. 


With this one machine, for example, you can completely control mixing and color of monaural recordings —and 
do this after studio hours. Maintain selective control over echo, equalization and special effects. You can also 
break into the booming stereo market. Open your doors to a new and wide variety of assignments such as 
industrial trade shows, sales meetings, and special events in your area. 


Write for the complete story on how you can make your business more flexible, open doors to new 
sources of income —told in our fully illustrated brochure on Multi-Channel Recording. Covers 
everything from techniques and applications to equipment specifications. Write for your copy. 


AMPEX 


8s2 CHARTER STREET, REDWOOD CcCiTy, CALIFORNIA CORPORATION 


professional 


’ . LLAS, HINGT uC. TROIT, ANCI ON 
NEW YORK, CHICAGO, LOS ANGELES, DALLAS, WASHINGTON D.C., DETRO SAN FRANCISCO products division 
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Be sure to mention the JOURNAL in replying to our advertisers. 


Now Available...The NEW 
Westrex StereoDisk System 


TWO CHANNELS OF IDENTICAL 
SOUND QUALITY IN ONE GROOVE 


The Westrex 3A Feedback Cutter records and The 
Westrex 10A Moving Coil Pickup reproduces 


two 45-degree recordings of identical sound qual- 
ity in a single groove, using a single stylus which 
is actuated by two separate sound channels. 


The Westrex StereoDisk Recorder will also cut 


vertical-lateral stereophonic. 


Provision for Hot-Stylus recording is included. 


For further information contact: 


Westrex Corporation 


111 Eighth Avenue, New York 11, N.Y. * Hollywood Division: 6601 Romaine Street, sistivwouliih, Cole 


The Westrex 
3A Recorder 
with Advance 
Ball and Suc- 
tion Tube. 


NEW HEADS For OLD! 


LANG ELECTRONICS is pleased 
to make available to professional 
users a service whereby they can 
economically extend the useful life 
of recording and playback tape 
recorder head assemblies. 

Heads are refinished by hand 
using a special process which in- 
sures negligible work hardening or 
smearing of the soft pole pieces. 
This is a very delicate operation 


Price 
Schedule 


requiring extremely low cutting 
pressures with selected and condi- 
tioned oil-stones. Final finishing 
is even more gentle and is designed 
to leave the iron unstrained for 
highest efficiency and ratio of 
mechanical to magnetic gap. 

All heads are subject to our in- 
spection to determine desirability 
of reconditioning. Heads recondi- 


AMP E X stereo head assembly 
AM PE half-track head assembly 


AMPE-X full-track assembly 


LANG ELECTRONICS 


507 Fifth Avenue, New York City 


tioned by us can be expected to 
have a useful life equal to or 
greater than that prior to recondi- 
tioning. 

We guarantee that all heads ac- 
cepted by us for reconditioning, 
when tested, will equal or exceed 
standard specifications with respect 
to frequency response and level for 
the particular make and model. 


MUrray Hill 2-7147 
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information for Convention and/or JOURNAL Authors 


SUGGESTIONS FOR PREPARING MANUSCRIPTS, PHOTOGRAPHS, CHARTS, 
DRAWINGS, AND LANTERN SLIDES; LETTERING SIZES, MAILING 


A Journal of original record. Papers orig- 


Society 
Conventions or for publication in the Jour- 
NAL OF THE AES. In rare instances, excep- 
tions may be made by the Convention and 
Publications Committees at their discretion. 

All papers presented at AES Conventions 
are automatically considered for publication 
in the Journat. 

Free and clear. It is assumed that all 
manuscripts submitted to the AES are offered 
free and clear. Any paper accepted for pre- 
sentation before the Society or for publica- 
tion in the JournaL or THE AES becomes the 
exclusive property of the Audio Engineering 
Society. Complete publication rights are held 
by the AES for primary publication in the 
JOURNAL. 

Permission to reprint—in whole or in part 
—papers originally published in the JournaL 
or tHe AES is usually granted freely by the 
Publications Committee upon written request 
and provided the authors agree. 

Multiple bylines. If the paper is to have 
more than one author, the exact form of the 
byline should be indicated on the manuscript 
and will be considered correct by the AES 
as given. 

Vital data. Make sure that the first page 
of your manuscript carries your name pre- 
cisely as you would like it rendered in the 
event of publication. If your business, pro- 
fessional or academic affiliation is to be given, 
this, too, should be included. If your posi- 
tion bears a formal title, you may include it. 

Typing the manuscript. Only one side of 
the sheet should be used. Margins should be 
at least one inch wide on each side. Triple 
spacing is preferred. 

Reviewing of the manuscript is speeded up 
considerably if several copies are submitted. 
(This is helpful but in no sense mandatory.) 
The copies may be clear carbons, mimeo- 
graphed sheets, copies made on a “spirit” 
duplicator, or blueprints. Review copies of 
diagrams, schematics, and graphs may be 
made by any convenient process 

Abstract. The author thould ‘precede his 
text by an abstract summarizing the paper in 
general. The abstract may include a sum- 
mary of observations and conclusions set 
forth. 

Subheads. Subheadings for important sec- 
tions of the paper make it easier to read in 
printed form. 

References. References to periodical litera- 
ture should include the author’s full name, 
exact title of the article or paper cited, full 
name of the publication, volume number, 
page numbers, month, and year. Book refer- 
ences should include the author’s full name, 
full title of the book, the specific page or 
pages referred to, the publisher, place of pub- 
lication, and year of publication. References 
to patent literature should preferably be 


INSTRUCTIONS, ETC. 


given as follows: name, number of the patent 
(U. S. or foreign), date of filing, and date of 
issue. A brief description of the patent is 
helpful. 

Mathematical symbols. Care should be 
taken to make all mathematical expressions 
clear to the printer. All Greek letters and 
any unusual symbols should be identified in 
the margin. Only the very simplest formulas 
should be typewritten: all others should be 
written in carefully in ink. Do not neglect 
to give the meanings of all symbols used. 

Captions for illustrations. A caption— 
properly identified—should be supplied for 
each illustration and a legend for each chart. 
These captions should be listed—in complete 
form and consecutively—on a single sheet of 
paper. 

Photographs. All illustrative material 
needed for a particular manuscript should be 
referred to specifically in the text and should 
accompany the manuscript when it is mailed. 

Photographs and drawings should be pre- 
pared carefully to insure good reproduction. 
Photos should be standard 8 in. x 10 in. glossy 
prints. Since extremely fine detail tends to 
be obscured in all reproduction processes, it 
is often advisable to include a separate pic- 
ture—i.e., a “‘closeup”—of any highly signifi- 
cant detail, in addition to the general view 
which describes the overall field. 


Drawings. The reproduction copies of 
sketches, of curves, or schematics (as distin- 
guished from any review copies submitted) 
should preferably be original drawings in 
India ink on white paper or on tracing paper, 
8% in. x 11 in. Curves made on conventional 
graph paper will reproduce poorly, but black 
India ink tracings, in which only the princi- 
pal cross-section lines are rendered, are satis- 
factory. 

Sharp, high-contrast photographs of line 
drawings are acceptable. 

Please do not send black-and-white lantern 
slides or color transparencies for publication 
—only photographic prints or original draw- 
ings should be submitted. 

Lettering. On the aforementioned 8% in. 
x 11 in. sheets, lettering and numerical data 
should not be less than 0.12 in. high. Neces- 
sary labeling should be lettered unto curves 
and sketches. On the other hand, if extensive 
descriptive material is needed, it is better to 
put such information into the typewritten 
captions rather than to attempt to letter the 
information onto the curve or sketch. 

Mailing. Mail one copy of your manu- 
script to the Convention Chairman. This 
copy is for scheduling and publicity pur- 
poses. Please mail all other copies of your 
manuscript to the Editor. 

Mail that copy of the manuscript which is 
accompanied by the reproduction copies of 
your illustrative material (the Editor’s copy) 
flat, with plenty of stiff cardboard enclosed. 
It is advisable to mark the envelope 
“PLEASE DO NOT BEND.” 


ORAL PRESENTATION OF THE PAPER 

Time allotment. The average time allowed 
for any paper, unless the author requests 
more time on his Author’s Form, is 20 
minutes. If you are not going to deliver 
your paper in person, please supply the name 
of your alternate to the Convention Chair- 
man as early as possible. 
Special oral version. An informal version 
of approximately 20 minutes, having an air 
of spontaneity, is usually more effective than 
a rushed verbatim reading of the manuscript 
exactly as written for publication. Many 
authors, after submitting the formal full- 
length version of their paper for publication, 
prepare an informal version for their guid- 
ance during oral delivery. 

Demonstrations. Demonstrations always 
add interest. They should be set up well in 
advance of the particular session for which 
they are intended and tested under actual 
operating conditions. The Convention Com- 
mittee will cooperate in every way within 
its power. 

Facilities. If you expect to need a black- 
board or any special facilities (electric power, 
large table or tables, etc.) please notify the 


are to be made should be held within the pro- 
portions or overall dimensions of 7 in. x 10 
in., with no more than 20, and preferably 
only 15 words to a chart. Vertical Gothic 
capitals are recommended. 

For a 7 in. x 10 in. working area, the 
smallest desirable letter height is 0.14 in. 
(made with a Leroy pen O or Wrico pen 7). 

Recommended, too, are the following line 
widths: 

For curves—1%4 to 2 points or 0.021 


Thumb marks. To indicate proper orienta- 
tion, a thumb mark should be placed in the 
lower left-hand corner of the slide, when the 
latter is viewed as it will appear projected on 
the screen. 

More complete information on lantern slide 
dimensions is given in American Standard 
Dimensions for Lantern Slides, Z38.7.19-1950. 


ee 
ee : 
inally published elsewhere or promised for ; 
publication elsewhere are not accepted for ; 
Convention Chairman as early as possible. 
Lantern slides. Photographs, diagrams, 
charts, and curves intended to accompany 
oral delivery should be in the form of stand- : 
ard American lantern slides (3% in. x 4 in.) 
or 2 in. x 2 in. transparencies. The long 
dimension of the projected area should be : 
horizontal in the standard slide, but may be : 
either. horizontal or vertical in the 2 in. x 7 
2 in. slide. 
PLEASE NOTE: Opaque projection of 
paper prints or drawings is not recommended. 
Nor can we guarantee that facilities for this : 
type of projection will be provided. Because 
of the dimness of the image, opaque projec- 
tion is unsatisfactory for large audiences. 
Lettering on lantern slides. Charts, dia- 
grams and curves from which lantern slides 
in. to 0.028 in. 
For grid rules—% point or 0.007 in. 
For reference lines—1 point or 0.014 
in. 
? 


SUSTAINING ORGANIZATIONS 


Grateful thanks of the Audio Engineering Society are hereby extended 
to the following organizations which, as sustaining members of the Society, 
help make this publication possible. These organizations are: 


Atec LANsING CorPorATION 

Ampex Avnio, INc., AMPEX CORPORATION 
Auvptio Devices, INc. 

Auvpio 

BritisH INpustRIES CoRPORATION 

Capito. Recorps, INc. 

Co_umBiA Recorps, INc. 

CoMPONENTS CORPORATION 

DicraPHONE CorPOoRATION 

Farrcuitp Recorpinc EquipMENT CoMPANY 
Harvey Rapio Company, Inc. 

Hicu Fme.ity, AupIocRAFT 

INstTITUTE OF HicH Fivettry MANUFACTURERS, INC. 
INTERNATIONAL BustNeEss MACHINES CorPORATION 
James B. Lanstnc Sounpn, INc. 

McIntTosH Laporatory, INc. 

MEASUREMENTS CORPORATION 

OBERLINE, INc. 

PERMOFLUX CORPORATION 

Pickertnc & Company, INc. 

Reeves Sounp Stupios, INc. 

Reeves SOUNDCRAFT CoRPORATION 


Rex-O-Kut Company 
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